LEARN FROM THE BEST TRAINERS IN THE CLOUD

  • 5
    Days
  • 8
    Trainers
  • 250+
    Attendees
ML WORKSHOPS FOR ENGINEERS

The workshops from the list below are the complimentary part of full tickets.

The recordings of most workshops will be shared with full ticket holders after the conference.

Workshops will be run remotely via Zoom.

Dmitry Soshnikov

Introduction to Machine Learning on the Cloud

This workshop will be both a gentle introduction to Machine Learning, and a practical exercise of using the cloud to train simple and not-so-simple machine learning models. We will start with using Automatic ML to train the model to predict survival on Titanic, and then move to more complex machine learning tasks such as hyperparameter optimization and scheduling series of experiments on the compute cluster. Finally, I will show how Azure Machine Learning can be used to generate artificial paintings using Generative Adversarial Networks, and how to train language question-answering model on COVID papers to answer COVID-related questions.

Table of contents
  • - Introduction to AI in the Microsoft Azure Cloud
  • - Introduction to Azure Machine Learning
  • - Training Titanic predictive model using Automatic ML
  • - Using Jupyter Notebooks in Azure ML
  • - Using Visual Studio Code to schedule and track experiments
  • - Hyperparameter optimization and HyperDrive
  • - Training GANs on Azure ML to produce Artificial Art
  • - Using DeepPavlov NLP to answer COVID-related questions on CORD dataset
Prerequisites

To actively take part in this workshop, you will need Microsoft Azure Account. In order not to spend time during the workshop, please register free Azure account in advance if you do not already have one (students can register student account).

Also, you may want to install Visual Studio Code, and Azure Account and Azure ML Extensions (those are required just for one exercise).

Workshop level

Beginners and ML practitioners.

Workshop schedule & location

Nov 2, 18:00-20:00 CET. The workshop will be recorded.

Mikhail Burtsev

AI Assistant with DeepPavlov

Simple chatbots can be built with a number of simple rules, but their UX will be very narrow and limited. Building complex AI assistants, in contrast, requires a lot of effort. To make your users happy, you have to master many things at once: rules, intents, finite state machines, chit-chat tech, dialog flow management, and many other things. It's a job for an entire team. Fortunately, you don't have to hire it. Instead, you can rely upon your trusty software from DeepPavlov.ai. With it, you can offload the complexity of managing the entire infrastructure to DeepPavlov Agent – an open-source Conversational AI orchestrator. Next, you can pick state-of-the-art NLP components from DeepPavlov Library to decode human emotions and understand their intents. Finally, you can pick some of the available skills from DeepPavlov Dream to avoid reinventing the wheel like chit-chat.

In this workshop, you'll be guided through the entire process, from an overview of the platform to building a sample AI assistant that you'll be able to re-use later in your own work.

By the end of this workshop you'll be able to use the provided components from the DeepPavlov family of projects to build simple AI assistant:

  • work with existing NLP components from DeepPavlov Library like sentiment and emotion classifiers
  • build simple goal-oriented skill with DeepPavlov Go-Bot
  • combine them together with the DeepPavlov Agent into a simple AI assistant
  • augment your simple AI assistant with some of the DeepPavlov Dream pre-built skills
Workshop level

Intermediate or Advanced.

Workshop schedule & location

Nov 3, 18:00-21:00 CET. The workshop will be recorded.

Argyris Argyrou & Theodors Palamas

From Shazam to Today's Audio Fingerprinting Algorithms, the Theory, Best Practices, and Open Source Libraries to Experiment with

An acoustic fingerprint is a condensed digital summary, a fingerprint, deterministically generated from an audio signal, that can be used to identify an audio sample or quickly locate similar items in an audio database. Using open-source python libraries, we will describe the process of creating an audio fingerprint from a media file, how to store it, and search for matches.

Table of contents
  • - 1st part. An introduction to audio fingerprinting–one of the first algorithms developed in the industry and the most successful commercially until today was developed by researchers from Shazam. Their solution is to identify the strongest peaks in the spectrogram and to store the relative signatures of these peaks.
  • - 2nd part. Shazam allows you to send a recording made on your phone of nearly any song–not covers, and it will tell you the song’s name, and other data about that song. We will describe the process of creating an audio fingerprint from a media file, how to store it, and search for matches.
Workshop schedule & location

Nov 4, 17:00-18:30 CET. The workshop will be recorded.

Jason Mayes

Hands on with TensorFlow.js

Come check out our workshop which will walk you through 3 common journeys when using TensorFlow.js. We will start with demonstrating how to use one of our pre-made models - super easy to use JS classes to get you working with ML fast. We will then look into how to retrain one of these models in minutes using in browser transfer learning via Teachable Machine and how that can be then used on your own custom website, and finally end with a hello world of writing your own model code from scratch to make a simple linear regression to predict fictional house prices based on their square footage.

Prerequisites

Workshop requires some working knowledge of using HTML / CSS / JS to get the most out of it. No Machine Learning background is assumed.

Workshop level

Great for beginners or researchers looking to get the reach and scale of web technologies so more people can use their models. No ML knowledge is assumed.

Workshop schedule & location

Nov 4, 18:00-21:00 CET. The workshop will be recorded.

Bernhard Suhm & Louvere Walker-Hannon

Applied AI with AutoML and Interpretability in MATLAB

Are you new to Machine and Deep Learning and want to learn how to apply these techniques in your work? Building good machine learning models is difficult and time consuming – get empowered to apply AI to your domain with AutoML and tools that don’t require extensive coding experience!

Table of contents
  • - Part I (first hour): Seminar on how Automated Machine Learning (AutoML) can help practitioners and engineers use AI in their domains even though they lack experience with machine learning or are concerned about the black box nature of AI models. AutoML simplifies model development to a few steps, and interpretability can draw back the curtain from complex black box models.
  • - Part II (2 hours): Hands-on workshop where you work in our MATLAB Online platform and build machine learning models using interactive tools. You will apply machine learning to the problem of classifying human activities based on accelerometer sensors, get familiar with the basics of deep learning, and apply transfer learning to an image processing task.
Prerequisites

No machine learning background is assumed. No prior experience with MATLAB is expected, though casual knowledge of a high-level language will be helpful.

To actively take part and access MATLAB Online, you will need a MathWorks account. If you do not already have one, please register by following this link.

Workshop level

Great for beginners who need to familiarize themselves with the basics, but also researchers and engineers eager to learn about how advanced methods like AutoML and model interpretability will help them apply AI to their domains.

Workshop schedule & location

Nov 9, 18:00-21:00 CET. The workshop will be recorded.

Alyona Galyeva

The Hitchhiker's Guide to the Machine Learning Engineering Galaxy

Are you a Software Engineer who got tasked to deploy a machine learning or deep learning model for the first time in your life? Are you wondering what steps to take and how AI-powered software is different from traditional software? Then it is the right workshop to attend.

The internet offers thousands of articles and free of charge courses, showing how it is easy to train and deploy a simple AI model. At the same time in reality it is difficult to integrate a real model into the current infrastructure, debug, test, deploy, and monitor it properly. In this workshop, I will guide you through this process by sharing tips, tricks, and favorite open source tools that will make your life much easier. So, at the end of the workshop, you will know where to start your deployment journey, what tools to use, and what questions to ask.

Table of contents
  • - AI-powered software versus traditional software
  • - MLOPS and AI pipeline in a nutshell
  • - ML Platform: what is this?
  • - Hands-on with AI pipelines
Prerequisites

To get the most out of the workshop working knowledge of Python is required. No Machine Learning background is assumed.

Workshop level

Great for software engineers who got tasked with AI model deployment for the first time. No ML knowledge is assumed.

Workshop schedule & location

Nov 10, 18:00-20:00 CET. The workshop will be recorded.

Olga Megorskaya & Alexey Drutsa & Dmitry Ustalov

Crowdsourcing Practice for Efficient Data Labeling

In this tutorial, we present a portion of unique industry experience in efficient data labeling via crowdsourcing shared by both leading researchers and engineers from Yandex.

We will make an introduction to data labeling via public crowdsourcing marketplaces and will present the key components of efficient label collection. This will be followed by a practice session, where participants will choose one of the real label collection tasks, experiment with selecting settings for the labeling process, and launch their label collection project on one of the largest crowdsourcing marketplaces. The projects will be run on real crowds within the tutorial session. While the crowd performers are annotating the project set up by the attendees, we will present the major theoretical results in efficient aggregation, incremental relabeling, and dynamic pricing. We will also discuss the strengths and weaknesses as well as applicability to real-world tasks, summarizing our five year-long research and industrial expertise in crowdsourcing. Finally, participants will receive a feedback about their projects and practical advice on how to make them more efficient.

Workshop level

Beginners, advanced specialists, and researchers are invited to learn how to collect high quality labeled data and do it efficiently.

Workshop schedule & location

Nov 11, 18:00-21:00 CET. The workshop will be recorded.

Dmitry Soshnikov
Cloud Developer Advocate @ Microsoft, Russia

Dmitry is a Microsoft veteran, working for more than 13 years. He started as a Technical Evangelist, and in this role presented on numerous conferences, including twice being on stage with Steve Ballmer. He then worked for 2 years as Senior Software Engineer, helping big European companies to start pilot digital transformation projects based on AI and ML. As Cloud Developer Advocate, Dmitry focuses on creating educational content and working with academic and research institutions. He is also an Associate Professor at MIPT, HSE and MAI in Moscow, a big fan of functional programming and F#, and a maintainer/primary developer of mPyPl library. In his spare time, Dmitry explores Science Art and Technological Magic, as well as performs Chinese tea ceremonies. He can be reached at soshnikov.com.

Mikhail Burtsev
November 6
Full Info
Mikhail Burtsev
DeepPavlov.ai, Russia

Mikhail Burtsev is a head of Neural Networks and Deep Learning Laboratory at Moscow Institute of Physics and Technology. He is also a founder and leader of open-source conversational AI framework DeepPavlov. Mikhail had proposed and co-organize a series of academic Conversational AI Challenges (including NIPS 2017, NeurIPS 2018, EMNLP 2020).

His research interests are in the fields of Natural Language Processing, Machine Learning, Artificial Intelligence and Complex Systems. Mikhail Burtsev has published more than 20 technical papers including – Nature, Artificial Life, Lecture Notes in Computer Science series, and other peer-reviewed venues.

Argyris Argyrou
Orfium.com, Cyprus

Argyris is the SVP of Data at Orfium.com, and a PhD candidate in Cyprus University of Technology. His research interests are NLP and Machine learning. He is the founder and lead researcher at summarly.com, and co-organizer of PyData Cyprus.

Theodoros Palamas
Orfium.com, USA

Theodoros is the Lead ML Data Scientist in Audio Recognition at Orfium.com. His research and work focuses both on deep learning and algorithmic methods to solve audio-related problems such as exact audio recognition, cover song recognition, denoising and audio classification. The process includes everything from feature design and extraction to model training and evaluation.

Jason Mayes
November 5
Full Info
Jason Mayes
Google, USA

Jason is a Senior Developer Advocate for TensorFlow.js at Google.

Jason combines his knowledge of the technical and creative worlds to solve complex, strategic / technical challenges for Google's largest customers and internal teams. Developing innovative world firsts utilizing the latest technologies and hardware is a key component of his role to rapidly prototype new ideas and consult on project solutions globally.

With a background in Computer Science at the University of Bristol, England, where he specialized in reality mining and invisible computing, Jason has been a "hybrid engineer" for over 15 years. Combining his passion for several areas including both front and back end web programming, but also design and user experience, he has worked in many sizes of companies from startups (including founding his own) to Google.

Bernhard Suhm
MathWorks, USA

Bernhard Suhm is the product manager for Machine Learning at MathWorks. He works closely with customer facing and development teams to address customer needs and market trends in our machine learning related products, primarily the Statistics and Machine Learning toolbox. Prior to joining MathWorks Bernhard led analyst teams and developed methods applying analytics to optimizing the delivery of customer service in call centers. He also held positions at a usability consulting company and Carnegie Mellon University. He received a PhD in Computer Science specializing in speech user interfaces from Karlsruhe University in Germany.

Louvere Walker-Hannon
MathWorks, USA

Louvere Walker-Hannon is a MathWorks Senior Application Engineer, who provides direction and recommendations on technical workflows for various applications. Specifically, she assists with providing guidance on the following topics image processing, computer vision, machine learning, deep learning, geospatial analysis, and data analytics when discussing technical workflows. She has a bachelor’s degree in Biomedical Engineering and a master’s degree in Geographic Information Technology with a specialization in Remote Sensing. Louvere has worked in three different engineering roles throughout her 20 year career while at MathWorks and is a STEM advocate.

Alyona Galyeva
Workshop
Full Info
Alyona Galyeva
LINKIT, Netherlands

I encourage others to see different perspectives and constructively break the rules. Observe - Optimize - Learn - Repeat is my work and life motto. Next to it, I found my joy in building and optimizing end-to-end Machine Learning Systems.

  • Principal Data Solutions Engineer @ LINKIT
  • Organizer (volunteer) @ PyLadies Amsterdam
  • WaiACCELERATE Tech Mentor (volunteer) @ Women in AI
Olga Megorskaya
Yandex.Toloka, Russia

Olga is a head of Toloka crowdsourcing platform. She's been responsible for providing human-labeled data for all AI projects at Yandex and implementing crowd-based human-in-the-loop solutions in such areas as software testing, customer support, product localization, generation of content, etc. Olga helped Yandex to grow the number of crowd performers involved in data labeling from several dozens in 2009 up to 6.5M in 2020 and is now developing Toloka as a global infrostructure for data labelling available for all ML specialists. She graduated from the Saint Petersburg State University as a specialist in Mathematical Methods and Modeling in Economics. Also, she was a co-author of research papers and tutorials on efficient crowdsourcing and quality control at SIGIR, CVPR, KDD, WSDM, and SIGMOD.

Alexey Drutsa
Yandex.Toloka, Russia

Alexey is responsible for data-driven decisions and the crowd platform ecosystem. His research interests are focused on Machine Learning, Data Analysis, Auction Theory; his research is published at ICML, NeurIPS, WSDM, WWW, KDD, SIGIR, CIKM, and TWEB. Alexey was a co-author of three tutorials on practical A/B testing (at KDD2018, WWW2018, andSIGIR 2019) and four hands-on tutorials on efficient crowdsourcing (at KDD 2019, WSDM 2020, SIGMOD 2020, and CVPR2020). He served as a senior PC member at WWW2019 and as a PC member at several NeurIPS, ICML, KDD, WSDM, CIKM, and WWW conferences; he was also a session chair at WWW2017. He graduated from Lomonosov Moscow State University (Faculty of Mechanics and Mathematics) in 2008 and received his PhD in Computational Mathematics from the same university in 2011.

Dmitry Ustalov
Yandex.Toloka, Russia

Dr. Dmitry Ustalov is responsible for crowd activity analysis, investigation of suspected fraud, and product metrics at Yandex.Toloka. His research interests are focused on Crowdsourcing and Natural Language Processing. His studies are published at such premier venues as COLI, ACL, EACL, and EMNLP; he serves as a reviewer for COLI, SWJ, ACL, EMNLP, COLING, ISWC, *SEM, EKAW, and other publications. Dmitry has organized the Crowd Science workshop at NeurIPS 2020 as well as the hands-on crowdsourcing tutorials at SIGMOD/PODS '20 and WSDM '20. Before joining Yandex, he received his Kandidat Nauk (PhD) degree in 2018 from the South Ural State University (Russia) and was a post-doctoral research fellow at the University of Mannheim (Germany).