<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Data on AI and Society Course</title><link>https://msucerl.org/cmse101/tags/data/</link><description>Recent content in Data on AI and Society Course</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Tue, 05 May 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://msucerl.org/cmse101/tags/data/index.xml" rel="self" type="application/rss+xml"/><item><title>Week 2 Assignment: Data Analysis Exercise</title><link>https://msucerl.org/cmse101/assignments/week-02/</link><pubDate>Tue, 05 May 2026 00:00:00 +0000</pubDate><guid>https://msucerl.org/cmse101/assignments/week-02/</guid><description>&lt;h2 id="week-2-assignment-data-analysis-exercise"&gt;Week 2 Assignment: Data Analysis Exercise&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Due:&lt;/strong&gt; End of Week 2 | &lt;strong&gt;Format:&lt;/strong&gt; Short report + visualization | &lt;strong&gt;Length:&lt;/strong&gt; 400-600 words&lt;/p&gt;
&lt;hr&gt;
&lt;h2 id="-assignment-overview"&gt;📝 Assignment Overview&lt;/h2&gt;
&lt;p&gt;You&amp;rsquo;ll select a real dataset and analyze it through the lens of what you learned about data quality, representation, and how training decisions affect AI systems.&lt;/p&gt;
&lt;hr&gt;
&lt;h2 id="-instructions"&gt;📋 Instructions&lt;/h2&gt;
&lt;h3 id="part-1-dataset-selection--exploration"&gt;Part 1: Dataset Selection &amp;amp; Exploration&lt;/h3&gt;
&lt;ol&gt;
&lt;li&gt;Find a dataset that interests you (suggestions: Kaggle, UCI ML Repository, Google Dataset Search, or your field&amp;rsquo;s data repository)&lt;/li&gt;
&lt;li&gt;Download or access the dataset&lt;/li&gt;
&lt;li&gt;Document:
&lt;ul&gt;
&lt;li&gt;What does the dataset contain?&lt;/li&gt;
&lt;li&gt;How many records/observations?&lt;/li&gt;
&lt;li&gt;What features/variables?&lt;/li&gt;
&lt;li&gt;Who collected it and why?&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h3 id="part-2-critical-analysis-400-600-words"&gt;Part 2: Critical Analysis (400-600 words)&lt;/h3&gt;
&lt;p&gt;Write a report addressing:&lt;/p&gt;</description></item><item><title>Week 2: AI Systems &amp; Data</title><link>https://msucerl.org/cmse101/readings/week-02/</link><pubDate>Tue, 05 May 2026 00:00:00 +0000</pubDate><guid>https://msucerl.org/cmse101/readings/week-02/</guid><description>&lt;h2 id="week-2-ai-systems--data"&gt;Week 2: AI Systems &amp;amp; Data&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Focus:&lt;/strong&gt; How AI systems work; data and training&lt;/p&gt;
&lt;hr&gt;
&lt;h2 id="-required-readings"&gt;📚 Required Readings&lt;/h2&gt;
&lt;h3 id="primary-readings"&gt;Primary Readings&lt;/h3&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;&amp;ldquo;How Machine Learning Works: The Training Process&amp;rdquo;&lt;/strong&gt; (25 min)&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Training data, features, and labels&lt;/li&gt;
&lt;li&gt;Algorithms and parameter optimization&lt;/li&gt;
&lt;li&gt;Testing and validation&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;&amp;ldquo;Data: The Fuel of AI&amp;rdquo;&lt;/strong&gt; (20 min)&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Data collection and preparation&lt;/li&gt;
&lt;li&gt;Data quality and representation&lt;/li&gt;
&lt;li&gt;The role of big data in modern AI&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h3 id="supplementary-resources"&gt;Supplementary Resources&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&amp;ldquo;The Hidden Technical Debt in Machine Learning Systems&amp;rdquo; — Selected sections&lt;/li&gt;
&lt;li&gt;Interactive visualizations: Neural network playground&lt;/li&gt;
&lt;/ul&gt;
&lt;hr&gt;
&lt;h2 id="-discussion-prompts"&gt;💭 Discussion Prompts&lt;/h2&gt;
&lt;ol&gt;
&lt;li&gt;Why is data quality more important than data quantity?&lt;/li&gt;
&lt;li&gt;How do training decisions affect AI system behavior in the real world?&lt;/li&gt;
&lt;li&gt;What role does feedback play in AI systems over time?&lt;/li&gt;
&lt;/ol&gt;
&lt;hr&gt;
&lt;h2 id="-preparation-for-class"&gt;📝 Preparation for Class&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Find a dataset related to a topic you care about (e.g., Kaggle, UCI Machine Learning Repository)&lt;/li&gt;
&lt;li&gt;Reflect on what patterns you&amp;rsquo;d hope an AI system would find in that data&lt;/li&gt;
&lt;li&gt;Consider: What could go wrong if the data was biased or incomplete?&lt;/li&gt;
&lt;/ul&gt;
&lt;hr&gt;
&lt;h2 id="-related-assignment"&gt;🔗 Related Assignment&lt;/h2&gt;
&lt;p&gt;See &lt;a href="https://msucerl.org/cmse101/assignments/week-02/"&gt;Week 2 Assignment&lt;/a&gt; for this week&amp;rsquo;s task.&lt;/p&gt;</description></item></channel></rss>