Why Cloud Centric Data Lake is the future of EDW
In this first of two blogs, we want to talk about WHY an organization might want to look at a lakehouse architecture (based on Delta Lake) for their data analytics pipelines instead of the standard...
View ArticleLearn the Comcast Architecture for Enterprise Metadata and Security
Comcast will present a live session on their architecture for metadata and security at our upcoming Databricks AWS Cloud Data Lake DevDay. The event includes a hands-on lab with Databricks notebooks...
View ArticleA Guide to MLflow Talks at Data + AI Summit Europe 2020
In the last two years since its release, MLflow has seen a rapid adoption among enterprises and the data science community. With over 2M downloads, 260 contributors, and 100+ organizations...
View ArticleImproving the Spark Exclusion Mechanism in Databricks
Ed Note: This article contains references to the term blacklist, a term that the Spark community is actively working to remove from Spark. The feature name will be changed in the upcoming Spark 3.1...
View ArticleHealthcare and Life Sciences Agenda for Data + AI Summit Europe 2020
Looking for the best Healthcare and Life Sciences events and sessions at Data + AI Summit Europe 2020 (Nov 17-19)? Below are some highlights. You can also find all Healthcare-related sessions,...
View ArticleRetail and Consumer Goods Agenda for Data + AI Summit Europe 2020
Looking for the best Retail & CPG events and sessions at Data + AI Summit Europe 2020 (Nov 17-19)? Below are some highlights. You can also find all Retail-related sessions, including customer case...
View ArticleFinancial Services Agenda for Data + AI Summit Europe 2020
Looking for the best Financial Services events and sessions at Data + AI Summit Europe 2020 (Nov 17-19)? Below are some highlights. You can also find all Financial-related sessions, including customer...
View ArticleMedia and Entertainment Agenda for Data + AI Summit Europe 2020
Looking for the best Media and Entertainment (M&E) events and sessions at Data + AI Summit Europe 2020 (Nov 17-19) ? Below are some highlights. You can also find all M&E-related sessions,...
View ArticleLeveraging ESG Data to Operationalize Sustainability
The benefits of Environmental, Social and Governance (ESG) are well understood across the financial services industry. In our previous blog post, we demonstrated how asset managers can leverage data...
View ArticleAnalytics on the Data Lake With Tableau and the Lakehouse Architecture
Over the past two years we’ve seen a number of organizations moving their data work to the cloud. It simplifies access and scales to handle the biggest volumes. At Tableau, we’re all about customer...
View ArticleAnnouncing the Launch of SQL Analytics
Today, we announced the new SQL Analytics service to provide Databricks customers with a first-class experience for performing BI and SQL workloads directly on the data lake. This launch brings to life...
View ArticleData Teams Unite! Countdown to Data + AI Summit Europe
Data + AI Summit 2020 Europe takes place virtually in just a few days,from 17-19 November – and it’s free to attend! Formerly known as Spark + AI Summit, Data + AI Summit will bring together thousands...
View ArticleMLflow 1.12 Features Extended PyTorch Integration
MLflow 1.12 features include extended PyTorch integration, SHAP model explainability, autologging MLflow entities for supported model flavors, and a number of UI and document improvements. Now...
View ArticleHow to Evaluate Data Pipelines for Cost to Performance
Learn best practices for designing and evaluating cost-to-performance benchmarks from Germany’s #1 weather portal. While we certainly conduct several benchmarks, we know the best benchmark is your...
View ArticleFatal Force: Exploring Police Shootings With SQL Analytics
Introduction Data has shown that police in the United States kill civilians at a rate far higher than police in other wealthy countries.1 In 2019, law enforcement in the U.S. killed 33.5 civilians per...
View ArticleHow to Train XGBoost With Spark
XGBoost is currently one of the most popular machine learning libraries and distributed training is becoming more frequently required to accommodate the rapidly increasing size of datasets. To utilize...
View ArticleKey Sessions for AWS Customers at Data + AI Summit Europe 2020
Databricks and Summit Gold Sponsor AWS Present on a wide variety of topics at this year’s premier data and AI event. Amazon Web Services (AWS) is sponsoring Data + AI Summit Europe 2020 and our work...
View ArticleKey Sessions for Microsoft Azure Customers at Data + AI Summit Europe 2020
Databricks, diamond sponsor Microsoft and Azure Databricks customers to present keynotes and breakout sessions at Data + AI Summit Europe. Data + AI Summit Europe is the free virtual event for data...
View ArticleDatabricks Partner Executive Summit at Data + AI Summit 2020 Europe
This week’s Partner Executive Summit, held in concert with Data + AI Summit 2020 Europe, is a feature event for our 500+ partners globally, and we love to share how partners are critical to making a...
View ArticleDatabricks and Coursera Launch Data Science Specialization for Data Analysts
Earlier this year, Databricks made a massive investment in training by providing free self-paced courses to all of our customers. Databricks furthers this investment by partnering with Coursera to...
View Article