A Tale About Vulnerability Research and Early Detection
This is a collaborative post between Databricks and Orca Security. We thank Yanir Tsarimi, Cloud Security Researcher, of Orca Security for their contribution. Databricks’ number one priority is the...
View ArticleOMB M-21-31: A Cost-Effective Alternative to Meeting and Exceeding...
On August 29, 2021, the U.S. Office of Management and Budget (OMB) released a memo in accordance with the Biden Administration’s Executive Order (EO) 12028, Improving the Nation’s Cybersecurity. While...
View ArticleSaving Time and Costs With Cluster Reuse in Databricks Jobs
With our launch of Jobs Orchestration, orchestrating pipelines in Databricks has become significantly easier. The ability to separate ETL or ML pipelines over multiple tasks offers a number of...
View ArticleHow Butcherbox Uses Data Insights to Provide Quality Food Tailored to Each...
This is a guest authored post by Jake Stone, Senior Manager, Business Analytics at ButcherBox The impact of a legacy data warehouse on business speed and agility From the outside, the ButcherBox...
View ArticleStructured Streaming: A Year in Review
As we enter 2022, we want to take a moment to reflect on the great strides made on the streaming front in Databricks and Apache Spark™ ! In 2021, the engineering team and open source contributors made...
View ArticleSimplify Your Forecasting With Databricks AutoML
Last year, we announced Databricks AutoML for Classification and Regression and showed the importance of having a glass box approach to empower data teams. Today, we are happy to announce that we’re...
View ArticleUsing Apache Flink With Delta Lake
As with all parts of our platform, we are constantly raising the bar and adding new features to enhance developers’ abilities to build the applications that will make their Lakehouse a reality....
View ArticleDatabricks Delta Live Tables Announces Support for Simplified Change Data...
As organizations adopt the data lakehouse architecture, data engineers are looking for efficient ways to capture continually arriving data. Even with the right tools, implementing this common use case...
View ArticleA Breakup Letter to Data Warehouses
Dear Data Warehouse, We have been trying to make it work for a long time, some would say too long, and it’s just not working anymore. I want to say “it’s not you, it’s me”, but actually – it is you....
View ArticleDeploy Production Pipelines Even Easier With Python Wheel Tasks
With its rich open source ecosystem and approachable syntax, Python has become the main programming language for data engineering and machine learning. Data and ML engineers already use Databricks to...
View ArticleLakehouse for Financial Services: Paving the Way for Data-Driven Innovation...
When it comes to “data-driven innovation,” financial service institutions (FSI) aren’t what typically come to mind. But with massive amounts of data at their potential disposal, this isn’t for lack of...
View ArticleHow Gemini Built a Cryptocurrency Analytics Platform Using Lakehouse for...
This blog has been co-authored by Gemini. We would like to thank the Gemini team, Anil Kovvuri and Sriram Rajappa, for their contributions. Gemini is one of the top centralized cryptocurrency...
View ArticleBeyond LDA: State-of-the-art Topic Models With BigARTM
This post follows up on the series of posts in Topic Modeling for text analytics. Previously, we looked at the LDA (Latent Dirichlet Allocation) topic modeling library available within MLlib in...
View ArticleDatabricks Ventures Invests in Arcion to Enable Real-Time Data Sync with the...
Databricks customers, regardless of size and industry, are increasingly seeking to unify their data onto a single platform. To do this, they need a simple, scalable and performant solution for moving...
View ArticleGet to Know Your Queries With the New Databricks SQL Query Profile!
Databricks SQL provides data warehousing capabilities and first class support for SQL on the Databricks Lakehouse Platform – allowing analysts to discover and share new insights faster at a fraction of...
View ArticleDatabricks Ventures Partners With dbt Labs to Welcome Analytics Engineers to...
Today, we are thrilled to announce Databricks Ventures’ investment in dbt Labs. With this investment, we are proud to support the growth of the company behind a pivotal open source movement. Alongside...
View ArticleBuilding a Similarity-based Image Recommendation System for e-Commerce
Why recommendation systems are important Online shopping has become the default experience for the average consumer – even established brick-and-mortar retailers have embraced e-commerce. To ensure a...
View ArticleHyper-Personalization Accelerator for Banks and Fintechs Using Credit Card...
Just as Netflix and Tesla disrupted the media and automotive industry, many fintech companies are transforming the Financial Services industry by winning the hearts and minds of a digitally active...
View ArticleEnabling Zero Trust in the NOC With Databricks and Immuta
This post was written in collaboration with Databricks partner Immuta. We thank Sam Carroll, Partner Solutions Architect, Immuta, for his contributions. Imagine you are a NOC/SOC analyst in a...
View ArticleIntroducing Lakehouse for Healthcare and Life Sciences
Each of us will likely generate millions of gigabytes of health data in our lifetimes: medical and pharmacy claims, electronic medical records with extensive clinical documentation, medical images;...
View Article