Data By The Bay

We have seats in each training on the day, you can come by Galvanize at 8:30am and register onsite! The trainings start at 9am, and you may join when it is in progress.

The training schedule is as follows:

8:30am light breakfast at Galvanize (44 Tehama St)
9am-12noon training
12noon-1pm lunch
1-4pm training

We have two fundamental trainings from world-class experts, enabling you to build data pipelines similar to those underpinning the companies presenting at Data By the Bay. All trainings are taught in parallel on May 15, 2016, before Data By the Bay, at Galvanize (the conference venue). In order to register, get a single training pass and redeem it for one of the two trainings. They run in parallel.

Gabor Melli is the Chief Scientist at OpenGov, author of KDD-award winning paper about production NLP pipelines, and co-organizer of the KDD 2016. He returns with his highly reviewed Detailed Introduction to NLP.

Join Data Pipelines By the Bay on May 15th, 2016 for a unique day of Agile Data Science and End-to-End Data Pipeline Training.

We'll have dozens of engineers building, in one day, a complete analytics backend with:

Kafka message bus
Spark and Spark Streaming
Cassandra for persistence
Spark Notebook by Data Fellas for data analysis and Spark UI