Fall 2022 Data Science Insights Award Winner - Quantifying the Commons

Student: Dun-Ming Huang and Shuran Yang

The Data Insights Award recognizes the team with the best data science insights. One of the overarching goals of data science is to interpret the statistical and machine learning model results and implement solutions as necessary. Here, we recognize a team that has bulletproof insights based on their methods. We define an insight as anything that increases or improves the understanding of a trend, system, or phenomenon and how this trend, system, or phenomenon arises. Therefore this award is awarded based on not only the results of the project, but how these results are contextualized within the field.

Dun-Ming Huang and Shuran Yang collaborated with the Creative Commons team on Quantifying the Commons, in which the main objective was to quantify the size and diversity of the Commons. 

Previously, the Commons suffered from unreliable data retrieval methods that had low performance and efficiency. The student researchers solved this issue by designing and implementing a reliable data retrieval process on Creative Commons data. They were able to transform the unstable process into an algorithmic, reliable, documented, and interpretable process while producing more exhibitory data visualizations that could be produced readily on demand. 

In the future the student researchers hope this data extraction method will have an increased availability and user experience via automation. 

quantifyingthecommons

 In addition to this poster presentation, the team published a set of blog posts detailing their weekly progress. 

Week 0: What is Quantifying the Commons?

Week 1: Starting off with Google Custom Search

Week 2: Google Search off Code

Week 3–5: Boilerplating

Week 6: In the Data Science Life Cycle, we Ask: What do you need to discover?

Week 7: Visualize the Commons: Engineering Data

Week 7: Visualize the Commons: Exhibiting Data

Week 8: Modeling the Commons: Guide to Machine Learning

Week 8: Modeling the Commons: Spend the first four sharpening the axe

Week 8: Modeling the Commons: Text to Number

Week 8: Modeling the Commons: Training the Models

Week 9: Delivering the Commons: A Summary of Side Notes

Week 10: Quantifying the Commons

Congratulations to Dun-Ming and Shuran!