New Features to Accelerate the Path to Production With the Next Generation...
Today, at the Data + AI Summit Europe 2020, we shared some exciting updates on the next generation Data Science Workspace – a collaborative environment for modern data teams – originally unveiled at...
View ArticleMLflow Model Registry on Databricks Simplifies MLOps With CI/CD Features
MLflow helps organizations manage the ML lifecycle through the ability to track experiment metrics, parameters, and artifacts, as well as deploy models to batch or real-time serving systems. The MLflow...
View ArticleHow Scribd Uses Delta Lake to Enable the World’s Largest Digital Library
Scribd uses Delta Lake to enable the world’s largest digital library. Watch this discussion with QP Hou, Senior Engineer at Scribd and an Airflow committer, and R Tyler Croy, Director of Platform...
View ArticleDelta vs. Lambda: Why Simplicity Trumps Complexity for Data Pipelines
“Everything should be as simple as it can be, but not simpler” – Albert Einstein Generally, a simple data architecture is preferable to a complex one. Code complexity increases points of failure,...
View ArticleEnforcing Column-level Encryption and Avoiding Data Duplication With PII
This is a guest post by Keyuri Shah, lead software engineer, and Fred Kimball, software engineer, Northwestern Mutual. Protecting PII (personally identifiable information) is very important as the...
View ArticleACID Transactions on Data Lakes
As part of our Data + AI Online Meetup, we’ve explored topics ranging from genomics (with guests from Regeneron) to machine learning pipelines and GPU-accelerated ML to Tableau performance...
View ArticleAzure Databricks Achieves FedRAMP High Authorization on Microsoft Azure...
We are excited to announce that Azure Databricks is now Federal Risk and Authorization Management Program (FedRAMP) authorized at the High Impact level, enabling new data and AI use cases across public...
View ArticleEstablishing Your Career Path: Lessons Brought to You by Databricks’ Women in...
Women in Sales (WIS) is a global employee networking group (ERG) at Databricks dedicated to helping women accelerate their careers in sales. On October 13th, 2020, WIS hosted Heather Akuiyibo, VP of...
View ArticleSimplify Access to Delta Lake Tables on Databricks From Serverless Amazon...
This post is a collaboration between Databricks and Amazon Web Services (AWS), with contributions by Naseer Ahmed, senior partner architect, Databricks, and guest author Igor Alekseev, partner...
View ArticleAzure Databricks Now Generally Available in Azure Government
We are excited to announce that Azure Databricks is now generally available (GA) in Microsoft’s Azure Government (MAG) region, enabling new data and AI use cases for federal agencies, state and local...
View ArticleThe Analytics Evolution With Azure Databricks, Azure Synapse and Power BI
Let’s face it, the landscape of different analytics services and products is complicated and constantly evolving. The Databricks and Microsoft partnership that created Azure Databricks began 4 years...
View ArticleHow Retina Uses Databricks Container Services to Improve Efficiency and...
This is a guest community post authored by Brad Ito, CTO Retina.ai, with contributions by Databricks Customer Success Engineer Vini Jaiswal Retina is the customer intelligence partner that empowers...
View ArticleSee Databricks at re:Invent and Demystify Your Data
Databricks, founded by the original creators of Apache Spark™ and Delta Lake, is thrilled to be a Platinum sponsor at AWS re:Invent 2020, where you can see how we simplify data engineering, analytics...
View ArticleAzure Databricks Now Generally Available in the Azure China Region
We are excited to announce that Azure Databricks is now generally available in Microsoft’s Azure China region, enabling new data and AI use cases with fast, reliable and scalable data processing,...
View ArticleDatabricks Is Named a Visionary in the 2020 Gartner Magic Quadrant for Cloud...
Last week, Gartner published the Magic Quadrant (MQ) for Cloud Database Management Systems, where Databricks was recognized as a Visionary in the market.1 This was the first time Databricks was...
View ArticleLearn How Disney+ Built Their Streaming Data Analytics Platform With...
Martin Zapletal, Software Engineering Director at Disney+, is presenting at re:Invent 2020 with the session How Disney+ uses fast data ubiquity to improve the customer experience (must be registered to...
View ArticlePython Autocomplete Improvements for Databricks Notebooks
At Databricks, we strive to provide a world-class development experience for data scientists and engineers, and new features are constantly getting added to our notebooks to improve our users’...
View ArticleHandling Late Arriving Dimensions Using a Reconciliation Pattern
This is a guest community post authored by Chaitanya Chandurkar, Senior Software Engineer in the Analytics and Reporting team at McGraw Hill Education. Special thanks to MHE Analytics team members...
View ArticleTop Questions from Our Lakehouse Event
We recently held a virtual event, featuring CEO Ali Ghodsi, that showcased the vision of Lakehouse architecture and how Databricks helps customers make it a reality. Lakehouse is a data platform...
View ArticleA Step-by-step Guide for Debugging Memory Leaks in Spark Applications
This is a guest authored post by Shivansh Srivastava, software engineer, Disney Streaming Services. It was originally published on Medium.com Just a bit of context We at Disney Streaming Services use...
View Article