Orchestrating Data Analytics with Databricks Workflows
For data-driven enterprises, data analysts play a crucial role in extracting insights from data and presenting it in a meaningful way. However, many...
View ArticleHow Edmunds builds a blueprint for generative AI
This blog post is in collaboration with Greg Rokita, AVP of Technology at Edmunds. Long envisioned as a key milestone in computing, we've...
View ArticleApache Spark 3 Apache DataSketches: New Sketch-Based Approximate Distinct...
Introduction In this blog post, we'll explore a set of advanced SQL functions available within Apache Spark that leverage the HyperLogLog algorithm, enabling...
View ArticleSolution Accelerator: LLMs for Manufacturing
Since the publication of the seminal paper on transformers by Vaswani et. al. from Google, large language models (LLMs) have come to dominate...
View ArticleCongratulations to Summer Hacakthon Winners!
Earlier this year, Databricks launched Dolly 2.0: the world's first truly open instruction-tuned Large Language Model (LLM). To build off this excitement around...
View ArticleThe Data + AI Trifecta: People, Process, and Platform
Business leaders are all asking the same questions: How do we accelerate our company’s plan for data and AI? How can we take a...
View ArticleUsing Images and Metadata for Product Fuzzy Matching with Zingg
Product matching is an essential function in many retail and consumer goods organizations. Incoming products are compared to items in the existing product...
View ArticleNew Support for Conflict Resolution in Repos: Merge, Rebase and Pull
At Databricks, we are committed to simplifying the developer experience and are thrilled to unveil additional Git capabilities in Databricks Repos. Users can...
View ArticleMaking Spark Accessible: My Databricks Summer Internship
My summer internship on the PySpark team was a whirlwind of exciting events. The PySpark team develops the Python APIs of the open...
View ArticleGoverning cybersecurity data across multiple clouds and regions using Unity...
According to a 2023 report from Enterprise Search Group, 85% of organizations indicated they deploy applications on two or more IaaS providers, attesting...
View ArticleeasyJet bets on Databricks Lakehouse and Generative AI to be an Innovation...
This blog is authored by Ben Dias, Director of Data Science and Analytics and Ioannis Mesionis, Lead Data Scientist at easyJet Introduction to...
View ArticleDeploy Private LLMs using Databricks Model Serving
We are excited to announce public preview of GPU and LLM optimization support for Databricks Model Serving! With this launch, you can deploy...
View ArticleAnnouncing the Public Preview of Lakeview Dashboards!
We are excited to announce the public preview of the next generation of Databricks SQL dashboards, dubbed Lakeview dashboards. Available today, this new...
View ArticleBallard Power Systems RDU (Remote Diagnostics Unit) Visualization Platform...
This article represents a collaborative effort between Plotly, Ballard Power Systems, and Databricks. Fleets of buses worldwide run on hydrogen fuel cells made...
View ArticleCracking the Code: How Databricks is Reshaping Major League Baseball with...
Biomechanical data has emerged as a game-changing factor for Major League Baseball (MLB) teams, offering a competitive edge in enhancing player performance and...
View ArticleBringing Software Engineering Best Practices to Life Sciences R&D at Exai Bio
This blog was written in collaboration with Sukh Sekhon, Software Engineer, Cloud Infrastructure and Helen Li, Sr. Director of Engineering at Exai Bio...
View ArticleDatabricks Expands Brickbuilder Program to Include Lakehouse Accelerators
Today, we’re excited to announce Brickbuilder Accelerators, an expansion to the Brickbuilder Program that pairs the expertise of system integrator and consulting partners w...
View ArticleAccelerate Your AI Journey with Pre-built Industry Solutions on Databricks...
Every organization is seeking to gain value from data—whether internally or externally from third-party data acquired from data marketplaces. Organizations across industries can b...
View ArticleCrossing Bridges: Reporting on NYC taxi data with RStudio and Databricks
As data enthusiasts, we love uncovering stories in datasets. With Posit's RStudio Desktop and Databricks, you can analyze data with dplyr, create impressive...
View ArticleAnnouncing Inference Tables: Simplified Monitoring and Diagnostics for AI models
Have you ever deployed an AI model, only to discover it's delivering unexpected results in a real-world setting? Monitoring models is as crucial...
View Article