Databricks

↧

Image may be NSFW.
Clik here to view.

Databricks Launches Second MOOC: Scalable Machine Learning

June 29, 2015, 8:36 am

We have been working in collaboration with professors at UC Berkeley and UCLA to produce two freely available Massive Open Online Courses (MOOCs). The first MOOC was released earlier this month and has...

View Article

Image may be NSFW.
Clik here to view.

MyFitnessPal Delivers New Feature, Speeds up Pipeline, and Boosts Team...

July 2, 2015, 8:20 am

To learn more about how Databricks helped MyFitnessPal with analytics, check out an earlier article in Wall Street Journal (log-in required) or download the case study. We are excited to announce that...

View Article

Image may be NSFW.
Clik here to view.

Guest blog: PMML Support in Spark MLlib

July 2, 2015, 1:08 pm

This is a guest blog from our friend Vincenzo Selvaggio. The recently released Apache Spark 1.4 introduces PMML support to MLlib for linear models and k-means clustering. This achievement is the result...

View Article

Image may be NSFW.
Clik here to view.

New Visualizations for Understanding Spark Streaming Applications

July 8, 2015, 8:23 am

Earlier, we presented new visualizations introduced in Spark 1.4.0 to understand the behavior of Spark applications. Continuing the theme, this blog highlights new visualizations introduced...

View Article

Image may be NSFW.
Clik here to view.

Announcing SparkHub: A Community Site for Apache Spark

July 10, 2015, 8:00 am

Today, we are happy to announce SparkHub (http://sparkhub.databricks.com), a service for the Apache Spark™ community to easily find the most relevant Spark resources on the web. SparkHub contains the...

View Article

Image may be NSFW.
Clik here to view.

Introducing R Notebooks in Databricks

July 13, 2015, 8:06 am

Spark 1.4 was released on June 11 and one of the exciting new features was SparkR. I am happy to announce that we now support R notebooks and SparkR in Databricks, our hosted Spark service. Databricks...

View Article

Image may be NSFW.
Clik here to view.

Introducing Window Functions in Spark SQL

July 15, 2015, 9:09 am

In this blog post, we introduce the new window function feature that was added in Spark 1.4. Window functions allow users of Spark SQL to calculate results such as the rank of a given row or a moving...

View Article

Image may be NSFW.
Clik here to view.

Joint Blog Post: Bringing ORC Support into Apache Spark

July 16, 2015, 9:00 am

This is a joint blog post with our partner Hortonworks. Zhan Zhang is a member of technical staff at Hortonworks, where he collaborated with the Databricks team on this new feature. In version 1.2.0,...

View Article

Image may be NSFW.
Clik here to view.

Be Heard with the Spark Survey

July 21, 2015, 8:00 am

At Databricks, we are constantly working to improve Apache Spark. To help us and the Spark community, we would love to hear from you to help set Spark’s future direction. A recent example of the...

View Article

Image may be NSFW.
Clik here to view.

Yesware Deploys Production Data Pipeline in Record Time with Databricks

July 23, 2015, 6:00 am

We are happy to announce that Yesware chose Databricks to build its production data pipeline, completing the project in record time — in just under three weeks. Press release:...

View Article

Image may be NSFW.
Clik here to view.

Using 3rd Party Libraries in Databricks: Spark Packages and Maven Libraries

July 28, 2015, 10:11 am

In an earlier post, we described how you can easily integrate your favorite IDE with Databricks to speed up your application development. In this post, we will show you how to import 3rd party...

View Article

Image may be NSFW.
Clik here to view.

New Features in Machine Learning Pipelines in Spark 1.4

July 29, 2015, 9:11 am

Spark 1.2 introduced Machine Learning (ML) Pipelines to facilitate the creation, tuning, and inspection of practical ML workflows. Spark’s latest release, Spark 1.4, significantly extends the ML...

View Article

Image may be NSFW.
Clik here to view.

Diving into Spark Streaming’s Execution Model

July 30, 2015, 9:15 am

With so many distributed stream processing engines available, people often ask us about the unique benefits of Spark Streaming. From early on, Apache Spark has provided an unified engine that natively...

View Article

Image may be NSFW.
Clik here to view.

Guest blog: SequoiaDB Connector for Apache Spark

August 3, 2015, 8:58 am

This is a guest blog from Tao Wang at SequoiaDB. He is the co-founder and CTO of SequoiaDB, leading its long-term technology vision, and is responsible for the leadership of advanced technology...

View Article

Image may be NSFW.
Clik here to view.

Helping the Democratization of Big Data

August 5, 2015, 8:22 am

When we started Databricks, we thought that extracting insights from big data was insanely difficult for no good reason. You almost needed an advanced degree to be able to get any meaningful work done....

View Article

Image may be NSFW.
Clik here to view.

Announcing the Databricks Academic Partners Program

August 11, 2015, 8:07 am

Databricks was born from academic research and today we are giving back to the academic community with the Databricks Academic Partners program. This program will provide academic instructors and...

View Article

Image may be NSFW.
Clik here to view.

From Pandas to Apache Spark’s DataFrame

August 12, 2015, 9:23 am

This is a cross-post from the blog of Olivier Girardot. Olivier is a software engineer and the co-founder of Lateral Thoughts, where he works on Machine Learning, Big Data, and DevOps solutions. With...

View Article

Image may be NSFW.
Clik here to view.

Spark 1.5 Preview Now Available in Databricks

August 18, 2015, 8:39 am

We are excited to announce that starting today, Apache Spark 1.5.0 is available as a preview in Databricks. Our users can now choose to provision clusters with Spark 1.5 or previous Spark versions...

View Article

Image may be NSFW.
Clik here to view.

Spark Summit Europe Full Agenda available online

August 31, 2015, 12:18 pm

This October, join the Apache Spark community in Amsterdam at the Beurs Van Berlage for the very first Spark Summit in Europe! We are happy to announce that the full agenda is now finalized, you can...

View Article

Image may be NSFW.
Clik here to view.

Announcing Spark 1.5

September 9, 2015, 12:51 am

The inaugural Spark Summit Europe will be held in Amsterdam this October. Check out the full agenda and get your ticket before it sells out! Today we are happy to announce the availability of Apache...

View Article