Skip to content
Data never lie Data never lie

  • About me
  • Research and Academia
    • Conferences, Workshops, Seminar and Schools
    • Academic Publications
    • Curriculum Vitae
    • Academic webpage
    • Certificates
    • Public activity
  • Data Science
    • Kaggle Competitions
    • Signal Processing for Dummies
    • Spark pills
  • Travels and photos
    • The travelling Scientist
    • PhotoGallery
    • The Viewfinder
  • Posts
  • About me
  • Research and Academia
    • Conferences, Workshops, Seminar and Schools
    • Academic Publications
    • Curriculum Vitae
    • Academic webpage
    • Certificates
    • Public activity
  • Data Science
    • Kaggle Competitions
    • Signal Processing for Dummies
    • Spark pills
  • Travels and photos
    • The travelling Scientist
    • PhotoGallery
    • The Viewfinder
  • Posts
  • github
  • linkedin
  • instagram
  • facebook
  • Bluesky
  • Home
  • Data Science
  • kaggle
Alternating Least Squares (ALS) Spark ML
Posted inData Science kaggle

Alternating Least Squares (ALS) Spark ML

Alternating Least Squares (ALS) for Santander Kaggle competition The Kaggle Santander competition just concluded. I decided for this competition to continue my learning process of spark environment and invest time…
Read More
Kaggle  Bosch competition using Apache Spark
Posted inData Science kaggle

Kaggle Bosch competition using Apache Spark

Apache Spark for Kaggle competitions I competed in Kaggle Bosch competition to predict the failures during the production lines.  As described in another post, I decided to approach this competition…
Read More
scala-xgboost-spark
Posted inCode Data Science kaggle

Spark and XGBoost using Scala

Spark and XGBoost using Scala language Recently XGBoost project released a package on github where it is included interface to scala, java and spark (more info at this link). I…
Read More
PySpark for RedHat Kaggle competition
Posted inData Science kaggle

PySpark for RedHat Kaggle competition

Using PySpark for RedHat Kaggle competition Redhat Kaggle competition is not so prohibitive from a computational point of view or data management. The use of Pandas and xgboost, R allows…
Read More
PySpark first approaches. Tips and tricks
Posted inData Science kaggle

PySpark first approaches. Tips and tricks

First approaches to Apache Spark and PySpark. By participating in the recent competition Kaggle Bosch production line performance, I decided to try using Apache Spark and in particular PySpark. When…
Read More
Basic Insight by Zygmunt Zajac
Posted inData Science kaggle

Basic Insight by Zygmunt Zajac

  Our team - Paul Perry, Elena Cuoco* and Zygmunt Zając - did not discover the leak. We didn’t attain a winning score, but on the other hand, the features…
Read More

Posts pagination

1 2 Next page
Elena Cuoco

I'm a scientist who loves data, analyzing it, and extracting information. I work in the field of gravitational wave research and the application of artificial intelligence techniques.

BlueSky Latest Posts
Piacere, ET – Ep. 6 | Elena Cuoco
Copyright 2026 — Data never lie. All rights reserved. Bloglo WordPress Theme
Scroll to Top