"Apache Spark is a powerful, scalable real-time data analytics engine that is fast becoming the de facto hub for data science and big data. However, in parallel, GPU clusters are fast becoming the default way to quickly develop and train deep learning models. As data science teams and data savvy companies mature, they will need to invest in both platforms if they intend to leverage both big data and artificial intelligence for competitive advantage.
This session will cover:
- How to leverage Spark and TensorFlow for hyperparameter tuning and for deploying trained models
- DeepLearning4J, CaffeOnSpark, IBM's SystemML and Intel's BigDL
- Sidecar GPU cluster architecture and Spark-GPU data reading patterns
- The pros, cons and performance characteristics of various approaches
You'll leave the session better informed about the available architectures for Spark and deep learning, and Spark with and without GPUs for deep learning. You'll also learn about the pros and cons of deep learning software frameworks for various use cases, and discover a practical, applied methodology and technical examples for tackling big data deep learning.
Session hashtag: #SFds14"
About: Databricks provides a unified data analytics platform, powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business.
Read more here: [ Ссылка ]
Connect with us:
Website: [ Ссылка ]
Facebook: [ Ссылка ]
Twitter: [ Ссылка ]
LinkedIn: [ Ссылка ]
Instagram: [ Ссылка ] Databricks is proud to announce that Gartner has named us a Leader in both the 2021 Magic Quadrant for Cloud Database Management Systems and the 2021 Magic Quadrant for Data Science and Machine Learning Platforms. Download the reports here. [ Ссылка ]
![](https://i.ytimg.com/vi/aAhAJFk1OVc/maxresdefault.jpg)