Databricks

↧

Image may be NSFW.
Clik here to view.

Improving Threat Detection in a Big Data World

December 4, 2017, 2:28 pm

High-profile cybersecurity breaches dominated headlines in 2017. In the first half of the year, over 1.9B records were stolen. That’s more than 7,000 records breached every minute. And the fallout from...

View Article

Image may be NSFW.
Clik here to view.

Spark Summit is Becoming the Spark + AI Summit

December 6, 2017, 2:04 pm

We’re excited to announce that Spark Summit is expanding its coverage in 2018 to include in-depth content on artificial intelligence. We are also renaming the conference Spark + AI Summit. AI has...

View Article

Image may be NSFW.
Clik here to view.

The Architecture of the Next CERN Accelerator Logging Service

December 14, 2017, 8:24 am

This is a community guest blog from Jakub Wozniak, a software engineer and project technical lead at CERN physics laboratory, further expounding and complementing his keynote at Spark Summit EU in...

View Article

Image may be NSFW.
Clik here to view.

Overstock Marketing + Databricks = Data Science at Scale

December 18, 2017, 6:01 am

This is a guest post from Chris Robison, Head of Marketing Data Science at Overstock.com. At Overstock.com we’ve never had a problem with a lack of data. At 19 years old, we have one of the most...

View Article

Unifying People Processes and Platform – The Movie

December 20, 2017, 11:38 am

Today we released our Databricks Unified Analytics Platform video. This short video illustrates to analytics leaders how Databricks can unify their analytics efforts onto one platform. This unification...

View Article

Image may be NSFW.
Clik here to view.

Databricks and Apache Spark 2017 Year in Review

January 3, 2018, 8:21 am

At Databricks we welcome the dawn of the New Year 2018 by reflecting on what we achieved collectively as a company and community in 2017. In this blog, we elaborate on the three themes: unification,...

View Article

Image may be NSFW.
Clik here to view.

Databricks Cache Boosts Apache Spark Performance

January 9, 2018, 8:45 am

We are excited to announce the general availability of Databricks Cache, a Databricks Runtime feature as part of the Unified Analytics Platform that can improve the scan speed of your Apache Spark...

View Article

Image may be NSFW.
Clik here to view.

Meltdown and Spectre’s Performance Impact on Big Data Workloads in the Cloud

January 13, 2018, 7:48 am

Last week, the details of two industry-wide security vulnerabilities, known as Meltdown and Spectre, were released. These exploits enable cross-VM and cross-process attacks by allowing untrusted...

View Article

Image may be NSFW.
Clik here to view.

Meltdown and Spectre: Exploits and Mitigation Strategies

January 16, 2018, 10:25 pm

In an earlier blog post, we analyzed the performance impact of Meltdown and Spectre on big data workloads in the cloud. In this blog post, we explain these exploits, their mitigation strategies and...

View Article

Matei Zaharia’s 5 predictions about AI in 2018

January 17, 2018, 5:10 pm

Over the past few years, the demand for artificial intelligence (AI) and machine learning capabilities has surged with innovations in natural language processing, task automation, and predictions. From...

View Article

Accelerate Innovation with Microsoft Azure Databricks

January 22, 2018, 4:32 pm

It’s hard to believe that we are already three weeks into 2018. If you’re still struggling to get valuable insights from your data, now is the perfect time to try something new! We recently announced...

View Article

Image may be NSFW.
Clik here to view.

Introducing Apache Spark 2.3

February 28, 2018, 2:19 pm

Today we are happy to announce the availability of Apache Spark 2.3.0 on Databricks as part of its Databricks Runtime 4.0. We want to thank the Apache Spark community for all their valuable...

View Article

Image may be NSFW.
Clik here to view.

Apache Spark 2.3 with Native Kubernetes Support

March 6, 2018, 12:27 pm

This is a community blog from Anirudh Ramanathan and Palak Bhatia, software engineer and product manager respectively at Google, working in the Kubernetes team. They are part of the group of companies...

View Article

Announcing Machine Learning Model Export in Databricks

March 7, 2018, 11:36 am

In recent years, machine learning has become ubiquitous in industry and production environments. Both academic and industry institutions had previously focused on training and producing models, but the...

View Article

Image may be NSFW.
Clik here to view.

Introducing Stream-Stream Joins in Apache Spark 2.3

March 13, 2018, 7:59 am

Since we introduced Structured Streaming in Apache Spark 2.0, it has supported joins (inner join and some type of outer joins) between a streaming and a static DataFrame/Dataset. With the release of...

View Article

Image may be NSFW.
Clik here to view.

Selected Sessions to Watch for at Spark + AI Summit 2018

March 15, 2018, 10:03 am

Early last month, we announced our agenda for Spark + AI Summit 2018, with over 180 selected talks with 11 tracks and training courses. For this summit, we have added four new tracks to expand its...

View Article

Image may be NSFW.
Clik here to view.

Introducing Low-latency Continuous Processing Mode in Structured Streaming in...

March 20, 2018, 9:25 am

Import this notebook on Databricks Structured Streaming in Apache Spark 2.0 decoupled micro-batch processing from its high-level APIs for a couple of reasons. First, it made developer’s experience...

View Article

Image may be NSFW.
Clik here to view.

Azure Databricks, industry-leading analytics platform powered by Apache Spark™

March 22, 2018, 9:00 am

The confluence of cloud, data, and AI is driving unprecedented change. The ability to utilize data and turn it into breakthrough insights is foundational to innovation today. Our goal is to empower...

View Article

Image may be NSFW.
Clik here to view.

Introducing Click: The Command Line Interactive Controller for Kubernetes

March 27, 2018, 9:07 am

Click is an open-source tool that lets you quickly and easily run commands against Kubernetes resources, without copy/pasting all the time, and that easily integrates into your existing command line...

View Article

Image may be NSFW.
Clik here to view.

Introducing Data Brick™: The Building Block of DataBricks’ Unified Analytics...

April 1, 2018, 6:02 am

As a digital society built around data and devices, we have reached a pivotal juncture where data and Artificial Intelligence must be accessible to everyone. Riding this trend, many homes now contain...

View Article