Fall 2022 Data Science Insights Award Winner - Quantifying the Commons
Student: Dun-Ming Huang and Shuran Yang
The Data Insights Award recognizes the team with the best data science insights. One of the overarching goals of data science is to interpret the statistical and machine learning model results and implement solutions as necessary. Here, we recognize a team that has bulletproof insights based on their methods. We define an insight as anything that increases or improves the understanding of a trend, system, or phenomenon and how this trend, system, or phenomenon arises. Therefore this award is awarded based on not only the results of the project, but how these results are contextualized within the field.
Dun-Ming Huang and Shuran Yang collaborated with the Creative Commons team on Quantifying the Commons, in which the main objective was to quantify the size and diversity of the Commons.
Previously, the Commons suffered from unreliable data retrieval methods that had low performance and efficiency. The student researchers solved this issue by designing and implementing a reliable data retrieval process on Creative Commons data. They were able to transform the unstable process into an algorithmic, reliable, documented, and interpretable process while producing more exhibitory data visualizations that could be produced readily on demand.
In the future the student researchers hope this data extraction method will have an increased availability and user experience via automation.
In addition to this poster presentation, the team published a set of blog posts detailing their weekly progress.
Week 0: What is Quantifying the Commons?
Week 1: Starting off with Google Custom Search
Week 2: Google Search off Code
Week 6: In the Data Science Life Cycle, we Ask: What do you need to discover?
Week 7: Visualize the Commons: Engineering Data
Week 7: Visualize the Commons: Exhibiting Data
Week 8: Modeling the Commons: Guide to Machine Learning
Week 8: Modeling the Commons: Spend the first four sharpening the axe
Week 8: Modeling the Commons: Text to Number
Week 8: Modeling the Commons: Training the Models
Week 9: Delivering the Commons: A Summary of Side Notes
Week 10: Quantifying the Commons
Congratulations to Dun-Ming and Shuran!