♫musicjinni

Pydata Berlin Meetup February 2021: Bulk Labelling

video thumbnail
Let's say you want to start working on a chatbot. Then the likely issue won't be that you don't have enough deep learning models that you can pip install. The real problem is probably that you don't have proper training data to start with. It's a precious moment in time to take serious, but what tools can we use for this?

In this talk I would like to share some techniques/tools that I've been working on for this use-case. The goal won't be to have a gold-standard right away. Instead the goal is to bootstrap as quickly as possible. During the talk I'll do some live-coding and I'll highlight some tools that I've open sourced that might help you bulk-label.



www.pydata.org

PyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R.

PyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases. 00:00 Welcome!
00:10 Help us add time stamps or captions to this video! See the description for details.

Want to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVideoTimestamps

Ankit Mahato- Supercharge Scientific Computing in Python with Numba | PyData Global 2020

Dante Gama Dessavre: Open Source is Better Together- GPU Python Libraries Unite | PyData LA 2019

Miroslav Šedivý - Python Lets go home quickly| PyData Global 2020

James Powell: So you want to be a Python expert? | PyData Seattle 2017

James Munro - Leveraging python and open-source for data-science on the buy-side |PyData Global2020

Adrien Treuille: Turn Python Scripts into Beautiful ML Tools | PyData LA 2019

Tailai Wen: ADTK: An open-source Python toolkit for anomaly detection in... | PyData Austin 2019

Gajendra Deshpande- Inventing Curriculum using Python and spaCy | PyData Global 2020

Moussa Taifi: Clean Machine Learning Code: Practical Software Engineering... | PyData New York 2019

Ekhtiar Syed: Exploratory Data Analysis (EDA) and Visualization Techniques.. | PyData Eindhoven 2019

Improving your Python skills with CodinGame.com | John Stinson | PyData Pune Meetup | July 2020

Super Search with Python and OpenSearch - Laysa Uchoa

NumFOCUS End-of-year Telethon: Hosted by James Powell, Don't Use This Code

Matthew Seal: Data and ETL with Notebooks in Papermill | PyData LA 2019

Travis E Oliphant: Extending Python Into the Future | PyData Austin 2019

Roman Yurchak- Pyodide Scientific Python Compiled to Webassembly Optimized| PyData Global 2020

Dash: data exploration web apps in pure Python - Chelsea Douglas

Dmitry Petrov: Machine Learning Models Versioning Using Open Source Tools | PyData LA 2019

Effective Pandas I Matt Harrison I PyData Salt Lake City Meetup

James Powell: I Just Inherited 50,000 Lines of Code! What Now? — A Practical Guide | PyData LA 2018

James Powell: What You Got Is What You Got | PyData LA 2019

Hareem Naveed: Write the Docs! | PyData LA 2019

Solving large scale inverse problems in Python with PyLops - M. Ravasi, I. Vasconcelos and D. Vargas

… - James Powell

Chiin Rui Tan- Ipywidgets for Education! | PyData Global 2020

Ana Castro Salazar, Pasha Stetsenko: Intro to Data Analysis with Python Data-table | PyData LA 2019

Satej Khedekar: A Python application to flag outliers in very high... | PyData Eindhoven 2019

Bertjan Broeksema & Huib Keemink - Exploring Railway Oriented Programming in Python | PyData Fest

Joseph Kearney, Shahid Barkat: A Python Package for Grappling with Missing Data | PyData LA 2019

Using Serverless, Python, R and Machine learning to save a country | PyData Athens

Disclaimer DMCA