This challenge is part of an effort to explore the use of machine learning to assist high energy physicists in discovering and characterizing new particles. Particles are the tiny constituents of matter generated in a collision between proton bunches. Physicists at CERN study particles using particle accelerators. The goal of this challenge is to build a machine learning model to read images of particles and identify their type.

1 Kaggle kernels

2 Setting python version and anaconda environment for R

3 Python starter code given by the organizer

cirtaChallenge.ipynb is a starter python notebook. It shows us how to open and view a .pkl file and starts you off with a simple classifier.

4 R code

4.2 Load R packages

FALSE 
FALSE Attaching package: 'dplyr'
FALSE The following objects are masked from 'package:stats':
FALSE 
FALSE     filter, lag
FALSE The following objects are masked from 'package:base':
FALSE 
FALSE     intersect, setdiff, setequal, union
FALSE 
FALSE Attaching package: 'data.table'
FALSE The following objects are masked from 'package:dplyr':
FALSE 
FALSE     between, first, last
FALSE 
FALSE Attaching package: 'EBImage'
FALSE The following object is masked from 'package:data.table':
FALSE 
FALSE     transpose
FALSE 
FALSE Attaching package: 'xgboost'
FALSE The following object is masked from 'package:dplyr':
FALSE 
FALSE     slice

5 Weigths compute

link

6 Preprocessing

Convert labels to numeric values

7 Split train and valid datasets