Announcing Databricks Seattle R&D Site
Today, we are excited to announce the opening of our Seattle R&D site and our plan to hire hundreds of engineers in Seattle in the next several years. Our office location is in downtown Bellevue....
View ArticleHow DPG Delivers High-quality and Marketable Segments to Its Advertisers.
This is a guest authored post by Bart Del Piero, Data Scientist, DPG Media. At the start of a campaign, marketers and publishers will often have a hypothesis of who the target segment will be, but...
View ArticleTackle Unseen Quality, Operations and Safety Challenges With Lakehouse...
Globally, out-of-stocks cost retailers an estimated $1T in lost sales. An estimated 20% of these losses are due to phantom inventory, the misreporting of product units actually on-hand. Despite...
View ArticleThe Foundation of Your Lakehouse Starts With Delta Lake
It’s been an exciting last few years with the Delta Lake project. The release of Delta Lake 1.0 as announced by Michael Armbrust in the Data+AI Summit in May 2021 represents a great milestone for the...
View ArticleScala at Scale at Databricks
With hundreds of developers and millions of lines of code, Databricks is one of the largest Scala shops around. This post will be a broad tour of Scala at Databricks, from its inception to usage,...
View ArticleDeploying dbt on Databricks Just Got Even Simpler
At Databricks, nothing makes us happier than making our users more productive, which is why we are delighted to announce a native adapter for dbt. It’s now easier than ever to develop robust data...
View ArticleIntroducing Data Profiles in the Databricks Notebook
Before a data scientist can write a report on analytics or train a machine learning (ML) model, they need to understand the shape and content of their data. This exploratory data analysis is iterative,...
View ArticleIntroduction to Databricks and PySpark for SAS Developers
This is a collaborative post between Databricks and WiseWithData. We thank Founder and President Ian J. Ghent, Head of Pre-Sales Solutions R &D Bryan Chuinkam, and Head of Migration Solutions...
View ArticleAnnouncing CARTO’s Spatial Extension for Databricks — Powering Geospatial...
This is a collaborative post by Databricks and CARTO. We thank Javier de la Torre, Founder and Chief Strategy Officer at CARTO for his contributions. Today, CARTO is announcing the beta launch of...
View ArticleLog4j2 Vulnerability (CVE-2021-44228) Research and Assessment
This blog relates to an ongoing investigation. We will update it with any significant updates, including detection rules to help people investigate potential exposure due to CVE-2021-44228 both within...
View ArticleAnnouncing General Availability of Databricks SQL
Today, we are thrilled to announce that Databricks SQL is Generally Available (GA)! This follows the announcement earlier this month about Databrick SQL’s world record-setting performance for data...
View ArticleAre GPUs Really Expensive? Benchmarking GPUs for Inference on the Databricks...
It is no secret that GPUs are critical for artificial intelligence and deep learning applications since their highly-efficient architectures make them ideal for compute-intensive use cases. However,...
View ArticleDatabricks Named a Leader in 2021 Gartner® Magic Quadrant for Cloud Database...
Today, we are thrilled to announce that Databricks has been named a Leader in 2021 Gartner® Magic Quadrant for Cloud Database Management Systems. We believe this achievement makes Databricks the only...
View ArticleBuilding a Geospatial Lakehouse, Part 1
An open secret of geospatial data is that it contains priceless information on behavior, mobility, business activities, natural resources, points of interest and more. Geospatial data can turn into...
View ArticleEnabling Computer Vision Applications With the Data Lakehouse
The potential for computer vision applications to transform retail and manufacturing operations, as explored in the blog Tackle Unseen Quality, Operations and Safety Challenges with Lakehouse enabled...
View ArticleImplementing MLOps on Databricks using Databricks notebooks and Azure DevOps,...
This is the second part of a two-part series of blog posts that show an end-to-end MLOps framework on Databricks, which is based on Notebooks. In the first post, we presented a complete CI/CD framework...
View ArticleHow to Build Scalable Data and AI Industrial IoT Solutions in Manufacturing
This is a collaborative post between Bala Amavasai of Databricks and Tredence, a Databricks consulting partner. We thank Vamsi Krishna Bhupasamudram, Director – Industry Solution, and Ashwin...
View ArticleWhy We Invested in Labelbox: Streamline Unstructured Data Workflows in a...
Last month, Databricks announced the creation of Databricks Ventures, a strategic investment vehicle to foster the next generation of innovation and technology harnessing the power of data and AI. We...
View ArticleThe Lakehouse for Retail
Every morning, as people are just beginning to rise, the business of retail is already in full motion. Delivery trucks are beginning their routes to bring goods to stores and millions of homes....
View ArticleConfluent Streaming for Databricks: Build Scalable Real-time Applications on...
For many organizations, real-time data collection and data processing at scale can provide immense advantages for business and operational insights. The need for real-time data introduces technical...
View Article