Keynote: The state of Apache Beam
By

Keynote: The state of Apache Beam

In this talk we’ll provide an overview of the current and upcoming developments in the Apache Beam Runners. Based off that, we will look into new use cases and patterns. Finally, we want to brainstorm about new items for the roadmap, and encourage users and developers to share their ideas.

Read More
Mining scientific literature with Apache Beam
By

Mining scientific literature with Apache Beam

At BenchSci, we mine a subset of the world’s biological research papers (about 10 million of them) with the aim of extracting info that will accelerate future pharmaceutical research programs by enabling more reproducible experiments. While we have the luxury of processing this information in batch, it is not without its challenges. Information comes to us in a wide variety of data types and formats, from archives to individual documents, with little to no visibility into what the contents will be, or whether they will be consistent or not.

Read More
MLOps with TensorFlow Extended and Apache Beam
By

MLOps with TensorFlow Extended and Apache Beam

In this talk, Hannes is providing insights into using TensorFlow Extended (TFX) and Apache Beam for MLOps. He introduces how TFX is using Apache Beam for data pipeline tasks and for orchestration entire ML pipelines. The audience learned how to run ML production pipelines with Apache Beam, and therefore, free the data scientist’s time from maintaining production machine learning models. Hannes shows real-life examples for MLOps using TFX and Apache Beam.

Read More
Networking event
By

Networking event

We will host a networking event on a virtual space (Gather Town) where you will be able to interact with other members of the Apache Beam community from all over the world. Join at https://sg1.run/beam-summit

Read More