It’s Time to Re-evaluate Your Relationship With Hadoop
With companies forced to adapt to a remote, distributed workforce this past year, cloud adoption has accelerated at an unprecedented pace by +14% resulting in 2% or $13B above pre-pandemic forecasts...
View ArticleCreating Growth and Advancement Opportunities: Introducing the Women in Tech...
At Databricks, we recognize the importance of offering professional growth and advancement opportunities for all communities and are committed to fostering a work environment where every employee can...
View ArticleDatabricks Notebook Dark Theme
This became the most-requested feature in Databricks’ history, and now it’s here: a dark theme for the Databricks notebook! We’re excited for you to try it out. To turn it on, open a notebook and...
View ArticleTop Questions from Customers about Delta Lake
Top Questions from Customers about Delta Lake Last week, we hosted a virtual event highlighting Delta Lake, an open source storage layer that brings reliability, performance and security to your data...
View ArticleIntroducing Delta Time Travel for Future Data Sets
We are thrilled to introduce enhanced time travel capabilities in Databricks Delta, the next-gen unified analytics engine built on top of Apache Spark, for all of our users. With this new feature,...
View ArticleData Democratization: A Key to Building a Healthy Data Culture
Building a thriving data culture is a strategic priority for many organizations, but only 24% of enterprises have managed to forge a data culture. What is a thriving data culture anyway? In its purest...
View ArticleData + AI Summit Is Back
Data + AI Summit, the global event for the data community, returns May 24-28. We are thrilled to announce that registration for this free virtual event is now open! The future is open Data and AI are...
View ArticleFine-Grained Time Series Forecasting at Scale With Facebook Prophet and...
Advances in time series forecasting are enabling retailers to generate more reliable demand forecasts. The challenge now is to produce these forecasts in a timely manner and at a level of granularity...
View ArticleBenchmark: Koalas (PySpark) and Dask
Koalas is a data science library that implements the pandas APIs on top of Apache Spark so data scientists can use their favorite APIs on datasets of all sizes. This blog post compares the performance...
View ArticleEfficiently Building ML Models for Predictive Maintenance in the Oil and Gas...
Guest authored post by Halliburton’s Varun Tyagi, Data Scientist, and Daili Zhang, Principal Data Scientist, as part of the Databricks Guest Blog Program Halliburton is an oil field services company...
View ArticleIdentifying Financial Fraud With Geospatial Clustering
For most financial service institutions (FSI), fraud prevention often implies a complex ecosystem made of various components –- a mixture of traditional rules-based controls and artificial intelligence...
View ArticleDatabricks and University of Rochester
At Databricks, we strongly believe (“know” you could say) that data and AI are mission-critical for solving the biggest problems our world faces. From healthcare to sustainability to transportation,...
View Article7 Reasons to Learn PyTorch on Databricks
What expedites the process of learning new concepts, languages or systems? When learning a new task, do you look for analogs from skills you already possess? Across all learning endeavors, three...
View ArticleHow (Not) to Tune Your Model with Hyperopt
Hyperopt is a powerful tool for tuning ML models with Apache Spark. Read on to learn how to define and execute (and debug) the tuning optimally! So, you want to build a model. You’ve solved the harder...
View ArticleAttack of the Delta Clones (Against Disaster Recovery Availability Complexity)
Notebook: Using Deep Clone for Disaster Recovery with Delta Lake on Databricks For most businesses, the creation of a business continuity plan is crucial to ensure vital services, such as data stores,...
View ArticlePrivate Databricks Workspaces With AWS PrivateLink Is in Public Preview
We’re excited to announce that PrivateLink connectivity for Databricks workspaces on AWS (Amazon Web Services) is now in public preview, with full support for production deployments. This release...
View ArticleHow We Launched a Podcast: Lessons, (Minor) Mishaps & Key Takeaways
After six episodes featuring amazing leaders and practitioners in the data and AI community, we wrapped up season 1 of Data Brew by Databricks, our homegrown podcast hosted by us two – Denny and...
View ArticleReproduce Anything: Machine Learning Meets Lakehouse
Machine learning has proved to add unprecedented value to organization and projects – whether that’s for accelerating innovation, personalization, demand forecasting and countless other use cases....
View ArticleA Guide to Data + AI Summit Sessions: Machine Learning, Data Engineering,...
We are only a few weeks away from Data + AI Summit, returning May 24–28. If you haven’t signed up yet, take advantage of free registration for five days of virtual engagement: training, talks, meetups,...
View ArticleDatabricks Named Data Science & Analytics Launch Partner for New AWS for...
“Digital transformation” isn’t just a buzzword – especially in the media and entertainment industry. More than just a more efficient way of creating or distributing content, the move to the cloud for...
View Article