Databricks

↧

Image may be NSFW.
Clik here to view.

Delta Live Tables Announces New Capabilities and Performance Optimizations

June 29, 2022, 10:31 am

Since the availability of Delta Live Tables (DLT) on all clouds in April (announcement), we’ve introduced new features to make development easier, enhanced automated infrastructure management,...

View Article

Image may be NSFW.
Clik here to view.

Introducing MLflow Pipelines with MLflow 2.0

June 29, 2022, 12:19 pm

Since we launched MLflow in 2018, MLflow has become the most popular MLOps framework, with over 11M monthly downloads! Today, teams of all sizes use MLflow to track, package, and deploy models....

View Article

Image may be NSFW.
Clik here to view.

Designing a Java Connector for Delta Sharing Recipient

June 29, 2022, 1:32 pm

Making an open data marketplace Stepping into this brave new digital world we are certain that data will be a central product for many organizations. The way to convey their knowledge and their assets...

View Article

Image may be NSFW.
Clik here to view.

Recap of Databricks Machine Learning announcements from Data & AI Summit

June 30, 2022, 9:18 am

Databricks Machine Learning on the lakehouse provides end-to-end machine learning capabilities from data ingestion and training to deployment and monitoring, all in one unified experience, creating a...

View Article

Image may be NSFW.
Clik here to view.

Open Sourcing All of Delta Lake

June 30, 2022, 5:59 pm

The theme of this year’s Data + AI Summit is that we are building the modern data stack with the lakehouse. A fundamental requirement of your data lakehouse is the need to bring reliability to your...

View Article

Image may be NSFW.
Clik here to view.

Introducing Spark Connect – The Power of Apache Spark, Everywhere

July 7, 2022, 11:12 am

At last week’s Data and AI Summit, we highlighted a new project called Spark Connect in the opening keynote. This blog post walks through the project’s motivation, high-level proposal, and next steps....

View Article

Image may be NSFW.
Clik here to view.

Using Airbyte for Unified Data Integration Into Databricks

July 11, 2022, 8:00 am

Today, we are thrilled to announce a native integration with Airbyte Cloud, which allows data replication from any source into Databricks for all data, analytics, and ML workloads. Airbyte Cloud, a...

View Article

Databricks Ventures Invests in Tecton: An Enterprise Feature Platform for the...

July 12, 2022, 9:00 am

Operational machine learning, which involves applying machine learning to customer-facing applications or business operations, requires solving complex data problems. Data teams need to turn raw data...

View Article

Image may be NSFW.
Clik here to view.

6 Guiding Principles to Build an Effective Data Lakehouse

July 14, 2022, 10:19 am

In this blog post, we will discuss some guiding principles to help you build a highly-effective and efficient data lakehouse that delivers on modern data and AI needs to achieve your business goals. If...

View Article

Image may be NSFW.
Clik here to view.

Using Spark Structured Streaming to Scale Your Analytics

July 14, 2022, 10:22 am

This is a guest post from the M Science Data Science & Engineering Team. Modern data doesn’t stop growing “Engineers are taught by life experience that doing something quick and doing something...

View Article

Hunting for IOCs Without Knowing Table Names or Field Labels

July 15, 2022, 1:46 pm

There is a breach! You are an infosec incident responder and you get called in to investigate. You show up and start asking people for network traffic log and telemetry data. People start sharing...

View Article

Image may be NSFW.
Clik here to view.

Disaster Recovery Automation and Tooling for a Databricks Workspace

July 18, 2022, 1:37 pm

This post is a continuation of the Disaster Recovery Overview, Strategies, and Assessment blog. Introduction A broad ecosystem of tooling exists to implement a Disaster Recovery (DR) solution. While no...

View Article

Image may be NSFW.
Clik here to view.

Scanning for Arbitrary Code in Databricks Workspace With Improved Search and...

July 19, 2022, 8:00 am

How can we tell whether our users are using a compromised library? How do we know whether our users are using that API? These are the types of questions we regularly receive from our customers. Given...

View Article

Image may be NSFW.
Clik here to view.

Building a Cybersecurity Lakehouse for CrowdStrike Falcon Events Part II

July 19, 2022, 10:40 am

Visibility is critical when it comes to cyber defense – you can’t defend what you can’t see. In the context of a modern enterprise environment, visibility refers to the ability to monitor and account...

View Article

Image may be NSFW.
Clik here to view.

Sync Your Customer Data to the Databricks Lakehouse Platform With RudderStack

July 19, 2022, 1:41 pm

Collecting, storing, and processing customer event data involves unique technical challenges. It’s high volume, noisy, and it constantly changes. In the past, these challenges led many companies to...

View Article

Image may be NSFW.
Clik here to view.

Databricks SQL Highlights From Data & AI Summit

July 20, 2022, 1:25 am

Data warehouses are not keeping up with today’s world: the explosion of languages other than SQL, unstructured data, machine learning, IoT and streaming analytics have forced customers to adopt a...

View Article

Image may be NSFW.
Clik here to view.

Parallel ML: How Compass Built a Framework for Training Many Machine Learning...

July 20, 2022, 11:40 am

This is a collaborative post from Databricks and Compass. We thank Sujoy Dutta, Senior Machine Learning Engineer at Compass, for his contributions. As a global real estate company, Compass processes...

View Article

How the Lakehouse Empowered Rogers Communications to Modernize Revenue Assurance

July 21, 2022, 11:55 am

This is a guest post from Duane Robinson, Sr. Manager of Data Science at Rogers Communications. At Rogers Communications, we take pride in ensuring billing accuracy and integrity for our customers....

View Article

Key Retail & Consumer Goods Takeaways From Data + AI Summit 2022

July 21, 2022, 12:24 pm

Retail and Consumer Goods companies showed up big at Data + AI Summit this year! With incredible breakout sessions to a keynote and panel of top retail speakers like the VP of Ads Engineering at...

View Article

Power to the SQL People: Introducing Python UDFs in Databricks SQL

July 22, 2022, 10:36 am

We were thrilled to announce the preview for Python User-Defined Functions (UDFs) in Databricks SQL (DBSQL) at last month’s Data and AI Summit. This blog post gives an overview of the new capability...

View Article