Databricks

↧

Image may be NSFW.
Clik here to view.

Announcing a New Redash Connector for Databricks

May 28, 2020, 8:21 am

We’re happy to introduce a new, open source connector with Redash, a cloud-based SQL analytics service, to make it easy to query data lakes with Databricks. Traditionally, data analyst teams face...

View Article

Image may be NSFW.
Clik here to view.

Adaptive Query Execution: Speeding Up Spark SQL at Runtime

May 29, 2020, 7:00 am

This is a joint engineering effort between the Databricks Apache Spark engineering team — Wenchen Fan, Herman van Hovell and MaryAnn Xue — and the Intel engineering team — Ke Jia, Haifeng Chen and...

View Article

Image may be NSFW.
Clik here to view.

Vectorized R I/O in Upcoming Apache Spark 3.0

June 1, 2020, 7:00 am

R is one of the most popular computer languages in data science, specifically dedicated to statistical analysis with a number of extensions, such as RStudio addins and other R packages, for data...

View Article

Image may be NSFW.
Clik here to view.

Monitor Your Databricks Workspace with Audit Logs

June 2, 2020, 8:22 am

Cloud computing has fundamentally changed how companies operate – users are no longer subject to the restrictions of on-premises hardware deployments such as physical limits of resources and onerous...

View Article

Image may be NSFW.
Clik here to view.

Customer Lifetime Value Part 1: Estimating Customer Lifetimes

June 3, 2020, 8:00 am

Download the Customer Lifetimes Part 1 notebook to demo the solution covered below. The biggest challenge every marketer faces is how to best spend money to profitably grow their brand. We want to...

View Article

Image may be NSFW.
Clik here to view.

How the Minnesota Twins Scaled Pitch Scenario Analysis to Measure Player...

June 4, 2020, 8:14 am

Statistical Analysis in the Game of Baseball A single pitch in Major League Baseball (MLB) generates tens of megabytes of data, from pitch movement to ball rotation to hitter behavior to the movement...

View Article

Image may be NSFW.
Clik here to view.

Automate continuous integration and continuous delivery on Databricks using...

June 5, 2020, 7:00 am

CONTENTS Overview Why do we need yet another deployment framework? Simplifying CI/CD on Databricks via reusable templates Development lifecycle using Databricks Deployments How to create and deploy a...

View Article

Image may be NSFW.
Clik here to view.

Modernizing Risk Management Part 2: Aggregations, Backtesting at Scale and...

June 5, 2020, 8:34 am

Understanding and mitigating risk is at the forefront of any financial services institution. However, as previously discussed in the first blog of this two-part series, banks today are still struggling...

View Article

Image may be NSFW.
Clik here to view.

Accelerating developers by ditching the data center

June 10, 2020, 7:00 am

Guest blog by R Tyler Croy, Director of Platform Engineering at Scribd People don’t tend to get excited about the data platform. It is often regarded much like road infrastructure: nobody thinks much...

View Article

Image may be NSFW.
Clik here to view.

Data Teams Unite! Countdown to Spark + AI Summit

June 10, 2020, 5:41 pm

Spark + AI Summit 2020 is now virtual and free! June 22-26 is just around the corner and the excitement is building! More sessions. More speakers. 4x More training. And more of the world’s data...

View Article

Image may be NSFW.
Clik here to view.

Media and Entertainment Sessions You Don’t Want to Miss at Spark + AI Summit...

June 11, 2020, 10:20 am

For years, the Spark + AI Summit has been the premier meeting place for organizations looking to build artificial intelligence (AI) applications at scale with leading technologies such as Apache...

View Article

Image may be NSFW.
Clik here to view.

Financial Services Sessions You Don’t Want to Miss at Spark + AI Summit 2020

June 11, 2020, 11:01 am

Radical transformation is the theme of 2020, with customers demanding personalized products, improved protection against fraud, and digital experiences that match every small shift in behavior. Banks,...

View Article

Image may be NSFW.
Clik here to view.

A Guide to the MLflow Talk at Spark + AI Summit 2020

June 12, 2020, 7:00 am

It’s been 2 years since we originally launched MLflow, an open source platform for the full machine learning lifecycle, and we are thrilled and humbled by the adoption and impact it has gained in the...

View Article

Image may be NSFW.
Clik here to view.

Enterprise Cloud Service Public Preview on AWS

June 12, 2020, 7:30 am

At Databricks, we have had the opportunity to collaborate with companies that have transformed the way people live. Some of our customers have developed life saving drugs, delivered industry-first user...

View Article

Image may be NSFW.
Clik here to view.

Accelerating Somatic Variant Calling with the Databricks TNSeq Pipeline

June 15, 2020, 7:00 am

Genetic analyses are a critical tool in revolutionizing how we treat cancer. By understanding the mutations present in tumor cells, researchers can gain clues that lead to drug targets and eventually...

View Article

Simplify Data Conversion from Apache Spark to TensorFlow and PyTorch

June 16, 2020, 7:00 am

Petastorm is a popular open-source library from Uber that enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. We are excited to...

View Article

Image may be NSFW.
Clik here to view.

Healthcare and Life Sciences Sessions You Don’t Want to Miss at Spark + AI...

June 16, 2020, 10:17 am

The healthcare industry is in a rapid state of change. The COVID-19 pandemic has shined a light on how critical it is for healthcare payers, providers, pharmaceutical companies and government agencies...

View Article

Image may be NSFW.
Clik here to view.

Retail and Consumer Goods Sessions You Don’t Want to Miss at Spark + AI...

June 16, 2020, 12:22 pm

The current economic environment is having a significant impact on the Retail and Consumer Goods sector. Rapid changes in how consumers shop is forcing companies to rethink their sales, marketing, and...

View Article

On-Demand Virtual Session: Customer Lifetime Value

June 16, 2020, 1:19 pm

Before you can provide personalized services and offers to your customers, you need to know who they are. In this virtual workshop, retail and media experts will demonstrate how to build advanced...

View Article

Image may be NSFW.
Clik here to view.

Simplify Python environment management on Databricks Runtime for Machine...

June 17, 2020, 8:11 am

Today we announce the release of %pip and %conda notebook magic commands to significantly simplify python environment management in Databricks Runtime for Machine Learning. With the new magic commands,...

View Article