100 Years of Horror Films: An Analysis Using Databricks SQL
When it comes to the history of film, perhaps no genre says more about us as humans than horror, which taps into our biggest phobias and uncertainties about the world. With such a huge range – from...
View ArticleMoneyball 2.0: Real-time Decision Making With MLB’s Statcast Data
The Oakland Athletics baseball team in 2002 used data analysis and quantitative modeling to identify undervalued players and create a competitive lineup on a limited budget. The book Moneyball, written...
View ArticleNow Generally Available: Simplify Data and Machine Learning Pipelines With...
We are excited to announce the general availability of Jobs orchestration, a new capability that lets Databricks customers easily build data and machine learning pipelines consisting of multiple,...
View ArticleDatabricks Sets Official Data Warehousing Performance Record
Today, we are proud to announce that Databricks SQL has set a new world record in 100TB TPC-DS, the gold standard performance benchmark for data warehousing. Databricks SQL outperformed the previous...
View ArticleTurning 2 Trillion Data Points of Traffic Intelligence into Critical Business...
This is a guest authored post by Stephanie Mak, Senior Data Engineer, formerly at Intelematics. This blog post offers my experience of contributing to the open source community with Bricklayer,...
View ArticleBuilding the Next Generation Visualization Tools at Databricks
This post is a part of our blog series on our frontend work. You can see the previous one on “Simplifying Data + AI, One Line of TypeScript at a Time.” After years of working on data visualization...
View ArticleSummer 2021 Databricks Internship – Their Work and Their Impact!
With COVID precautions still in place, the 2021 Databricks Software Engineering Summer internship was conducted virtually with members of the intern class joining us from their home offices located...
View ArticleEliminating the DeWitt Clause for Database Benchmarking
At Databricks, we often use the phrase “the future is open” to refer to technology; it reflects our belief that open data architecture will win out and subsume proprietary ones. “Open” isn’t just about...
View ArticleAnnouncing Databricks Engineering Fellowship
We are excited to announce a new program called Databricks Engineering Fellowship to recognize new graduates with exceptional academic achievements or extracurricular impact, in the field of computer...
View ArticleWhat to Expect at Data + AI World Tour
This year, we’re doing things a bit differently. In this still very virtual world, we wanted to find a way to bring the energy of Data + AI Summit and the power of lakehouse to a whole new audience....
View Article10 Powerful Features to Simplify Semi-structured Data Management in the...
Ingesting and querying JSON with semi-structured data can be tedious and time-consuming, but Auto Loader and Delta Lake make it easy. JSON data is very flexible, which makes it powerful, but also...
View ArticleWhy Scale Matters in Modern Financial Compliance
Let’s talk regulation. While not the sexiest topic for banks to deal with, working with regulations and compliance are critical to financial institutions’ success. On average 10% of bank revenue is...
View ArticleSnowflake Claims Similar Price/Performance to Databricks, but Not So Fast!
On Nov 2, 2021, we announced that we set the official world record for the fastest data warehouse with our Databricks SQL lakehouse platform. These results were audited and reported by the official...
View ArticleEvolution of the SQL language at Databricks: ANSI standard by default and...
Today, we are excited to announce that Databricks SQL will use the ANSI standard SQL dialect by default. This follows the announcement earlier this month about Databricks SQL’s record-setting...
View ArticleDatabricks’ Open Source Genomics Toolkit Outperforms Leading Tools
Genomic technologies are driving the creation of new therapeutics, from RNA vaccines to gene editing and diagnostics. Progress in these areas motivated us to build Glow, an open-source toolkit for...
View ArticleAccenture and Databricks Lakehouse Accelerate Digital Transformation
This is a collaborative post from Accenture and Databricks. We thank Matt Arellano, Managing Director, Global Data & AI Ecosystem Lead — Accenture, for his contributions. To keep pace with the...
View ArticleNow Generally Available: Introducing Databricks Partner Connect to Discover...
Databricks is thrilled to announce Partner Connect, a one-stop portal for customers to quickly discover a broad set of validated data, analytics, and AI tools and easily integrate them with their...
View ArticleBuild Your Business on Databricks With Partner Connect
At Databricks we believe that to create the ultimate customer experience, we must leverage the work of more than just our employees and create a platform others can extend. To see the importance of...
View ArticleRay on Databricks
Ray is an open-source project first developed at RISELab that makes it simple to scale any compute-intensive Python workload. With a rich set of libraries and integrations built on a flexible...
View ArticleBuilding Analytics on the Lakehouse Using Tableau With Databricks Partner...
This is a guest authored post by Madeleine Corneli, Sr. Product Manager, Tableau On November 18, Databricks announced Partner Connect, an ecosystem of pre-integrated partners that allows customers...
View Article