Learn how to collect and annotate high-quality data for a computer vision model. Thought leaders in the Artificial Intelligence space such as Andrew Ng have been advocating for a shift from model-centric to data-centric AI. The idea behind this campaign is that AI models can be only marginally improved through tweaks in the algorithm but considerable change can only be achieved by using high-quality data. However, what does "high-quality data" mean and how do we go about ensuring the quality, diversity, and consistency of our dataset?
In this talk, we will discuss the practice of collecting and annotating data for your computer vision models and making sure the dataset you are using is representative and free of harmful biases.
About the Presenter:
Iva Gumnishka is the founder and CEO of Humans in the Loop, a professional data collection and annotation company focused on building high-quality datasets for computer vision applications. The company is a social enterprise and its mission is to provide dignified work opportunities to refugees and conflict-affected people through annotation projects. Iva holds a degree in Human Rights from Columbia University and she was named Forbes 30 under 30 in 2018.
For further tutorials on the fundamentals of machine learning, check out this exclusive playlist: [ Ссылка ]
Table of Contents:
0:00 – Introduction
3:28 – Why should we care about high-quality data for computer vision?
8:13 – Algorithmic methods for bias mitigation
9:17 – Model centric to data centric AI
11:06 – Take of EU
28:41 – Canonical large-scale dataset
31:47 – Self-supervised pretraining
36:46 – Image collection and labeling
38:03 – Data quality
40:03 – The future
48:09 – Questions
--
At Data Science Dojo, we believe data science is for everyone. Our data science trainings have been attended by more than 10,000 employees from over 2,500 companies globally, including many leaders in tech like Microsoft, Google, and Facebook. For more information please visit: [ Ссылка ]
💼 Learn to build LLM-powered apps in just 40 hours with our Large Language Models bootcamp: [ Ссылка ]
💼 Get started in the world of data with our top-rated data science bootcamp: [ Ссылка ]
💼 Master Python for data science, analytics, machine learning, and data engineering: [ Ссылка ]
💼 Explore, analyze, and visualize your data with Power BI desktop: [ Ссылка ]
--
Unleash your data science potential for FREE! Dive into our tutorials, events & courses today!
📚 Learn the essentials of data science and analytics with our data science tutorials: [ Ссылка ]
📚 Stay ahead of the curve with the latest data science content, subscribe to our newsletter now: [ Ссылка ]
📚 Connect with other data scientists and AI professionals at our community events: [ Ссылка ]
📚 Checkout our free data science courses: [ Ссылка ]
📚 Get your daily dose of data science with our trending blogs: [ Ссылка ]
--
📱 Social media links
Connect with us: [ Ссылка ]
Follow us: [ Ссылка ]
Keep up with us: [ Ссылка ]
Like us: [ Ссылка ]
Find us: [ Ссылка ]
--
Also, join our communities:
LinkedIn: [ Ссылка ]
Twitter: [ Ссылка ]
Facebook: [ Ссылка ]
Vimeo: [ Ссылка ]
Discord: [ Ссылка ]
_
Want to share your data science knowledge? Boost your profile and share your knowledge with our community: [ Ссылка ]
#Computervision #datascience #tech
Ещё видео!