Subject archive for "spark"

Spark, Dask, and Ray: choosing the right framework

By Nikolay Manchev

Ray clusters with Domino accelerate data science innovation.

Machine Learning

Domino Unlocks the Power of Data Science with Ray 2 Clusters

By Thomas Dinsmore and Yuval Zukerman

Making PySpark work with spaCy: Overcoming serialization errors

Using PySpark and PCA to analyze large neuroimaging datasets

By Sergul Aydore, Ph.D., and Syed Ashrafulla, Ph.D.

Machine Learning

Creating Multi-language Pipelines with Apache Spark or Avoid Having to Rewrite spaCy into Java

By Holden Karau