Garren – Garren's [Big] Data Blog

Protected: Training 2020-02-27

Posted by Garren on 2020/02/25

There is no excerpt because this is a protected post.

Default

Protected: Training 2020-01-31

Posted by Garren on 2020/01/30

There is no excerpt because this is a protected post.

Default

Databricks + Snowflake: Catalyzing Data and AI Initiatives

Posted by Garren on 2019/04/25

This post is accessible via garrens.com/DataSnowCat and references material covered at Spark + AI Summit (link) 2019.

Default ai, databricks, mlflow, snowflake Leave a Comment

Avoiding Performance Potholes: Scaling Python for Data Science using Spark @ Spark + AI Summit

Posted by Garren on 2018/06/05

Python is the de facto language of data science and engineering, which affords it an outsized community of users. However, when many data scientists and engineers come to Spark with a Python background, unexpected performance potholes can stand in the way of progress. These “Performance Potholes” include PySpark’s ease of integration with existing packages (e.g.… Continue reading→

Default Leave a Comment