Data Science

The Nobel prize in Physics 2017

Sono tante le emozioni che ho provato oggi. Voglio scriverle nel mio sito, per non dimenticarle. Per non dimenticare quanto ...
Read More

LIGO Virgo 2017 Collaboration meeting at CERN

Machine Learning Algorithms f2f I had great time chairing Machine Learning face to face meeting at LIGO-Virgo collaboration meeting at ...
Read More

Alternating Least Squares (ALS) Spark ML

Alternating Least Squares (ALS) for Santander Kaggle competition The Kaggle Santander competition just concluded. I decided for this competition to ...
Read More
scala-xgboost-spark

Install JVM xgboost package

Install JVM xgboost package to interface to Apache Spark For a complete guide and documentation, please refer to the official ...
Read More

Kaggle Bosch competition using Apache Spark

Apache Spark for Kaggle competitions I competed in Kaggle Bosch competition to predict the failures during the production lines.  As ...
Read More

Build and install Spark on Linux platform.

Build and install Spark on a Linux platform Here a short guide to build, install and configure Apache Spark on ...
Read More
new-project

Scala project under IntelliJ IDEA

Setup of a scala project using IntelliJ IDEA I suppose you have already downloaded and installed the community edition of ...
Read More

Spark code snippets

A growing post which gathers short pieces of code Let’s suppose spark to be an opened spark session. # Open ...
Read More
scala-xgboost-spark

Spark and XGBoost using Scala

Spark and XGBoost using Scala language Recently XGBoost project released a package on github where it is included interface to ...
Read More

PySpark for RedHat Kaggle competition

Using PySpark for RedHat Kaggle competition Redhat Kaggle competition is not so prohibitive from a computational point of view or ...
Read More