Digital signal processing (DSP) has been made easy with the help of many Python libraries, allowing engineers and researchers to quickly and effortlessly analyze audio, images, and video. However, scaling these algorithms and models to process millions of files has not been equally as seamless. At Spotify, we’re trying to address scaling DSP over our catalog of over 50 million songs. This talk will discuss the challenges we’ve encountered while building the infrastructure needed to support audio processing at scale.
Read MoreWe will explore an end to end example that combines batch and streaming aspects in one uniform Beam pipeline, and deploy it to a fully managed environment with Amazon Kinesis Data Analytics.
Read MoreWe will work through an end-to-end example of using the Python Beam SDK to build maintainable, reliable, and scalable production pipelines for deep learning.
Read MoreThis workshop is for participants who are getting started with Apache Beam. We will get hands-on writing Beam pipelines, as well as discuss the fundamentals of Beam programming model and SDKs.
Read MoreWe will implement a Google Cloud Dataflow pipeline through a series of labs where we will explore functionalities like auto-scaling, monitoring and optimization.
Read MoreLearn how to install Conda on your worker machine, personalize your workers’ environment and integrate with your Apache Beam pipeline.
Read MoreStep-by-step walkthrough of how to write custom TFX components with Apache Beam to customize your ML pipelines beyond the standard components and tailor the components for their ML pipelines.
Read More