=================================== A Soccer Action Valuation Toolkit =================================== `socceraction` is a Python package for objectively quantifying the value of the individual actions performed by soccer players using event stream data. It contains the following components: - A set of API clients for **loading event stream data** from `StatsBomb `__, `Wyscout `__ and `Opta `__. - Converters for each of these provider's proprietary data format to the **SPADL** and **atomic-SPADL** formats, which are unified and expressive languages for on-the-ball player actions. - An implementation of the **Expected Threat (xT)** possession value framework. - An implementation of the **VAEP** and **Atomic-VAEP** possession value frameworks. .. image:: actions_bra-bel.png :width: 600 :align: center .. toctree:: :hidden: :caption: Documentation documentation/intro documentation/install documentation/data/index documentation/spadl/index documentation/valuing_actions/index documentation/faq .. toctree:: :hidden: :caption: API reference api/data api/spadl_regular api/xthreat api/vaep api/spadl_atomic api/atomic_vaep .. toctree:: :hidden: :caption: Development Changelog development/developer_guide First steps =========== Are you new to soccer event stream data and possession value frameworks? Check out our `interactive explainer `__ and watch Lotte Bransen's and Jan Van Haaren's `presentation in Friends of Tracking `__. Once familiar with the basic concepts, you can move on to the :doc:`quickstart guide ` or continue with the hands-on video tutorials of the Friends of Tracking series: - Valuing actions in soccer (`video `__, `slides `__) This presentation expands on the content of the introductory presentation by discussing the technicalities behind the VAEP framework for valuing actions of soccer players as well as the content of the hands-on video tutorials in more depth. - Tutorial 1: Run pipeline (`video `__, `notebook `__, `notebook on Google Colab `__) This tutorial demonstrates the entire pipeline of ingesting the raw Wyscout match event data to producing ratings for soccer players. This tutorial touches upon the following four topics: downloading and preprocessing the data, valuing game states, valuing actions and rating players. - Tutorial 2: Generate features (`video `__, `notebook `__, `notebook on Google Colab `__) This tutorial demonstrates the process of generating features and labels. This tutorial touches upon the following three topics: exploring the data in the SPADL representation, constructing features to represent actions and constructing features to represent game states. - Tutorial 3: Learn models (`video `__, `notebook `__, `notebook on Google Colab `__) This tutorial demonstrates the process of splitting the dataset into a training set and a test set, learning baseline models using conservative hyperparameters for the learning algorithm, optimizing the hyperparameters for the learning algorithm and learning the final models. - Tutorial 4: Analyze models and results (`video `__, `notebook `__, `notebook on Google Colab `__) This tutorial demonstrates the process of analyzing the importance of the features that are included in the trained machine learning models, analyzing the predictions for specific game states, and analyzing the resulting player ratings. .. note:: The video tutorials are based on version 0.2.0 of the socceraction library. If a more recent version of the library is installed, the code may need to be adapted. Getting help ============ Having trouble? We'd like to help! * Try the :doc:`FAQ ` -- it's got answers to many common questions. * Looking for specific information? Try the :ref:`genindex` or :ref:`modindex`. * Report bugs in our `ticket tracker`_. .. _ticket tracker: https://github.com/ML-KULeuven/socceraction/issues Contributing ============ Learn about the development process itself and about how you can contribute in our :doc:`developer guide `. Research ======== If you make use of this package in your research, please consider citing the following papers. - Tom Decroos, Lotte Bransen, Jan Van Haaren, and Jesse Davis. **"Actions speak louder than goals: Valuing player actions in soccer."** In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 1851-1861. 2019. [`pdf`__, `bibtex`__] - Maaike Van Roy, Pieter Robberechts, Tom Decroos, and Jesse Davis. **"Valuing on-the-ball actions in soccer: a critical comparison of xT and VAEP."** In Proceedings of the AAAI-20 Workshop on Artifical Intelligence in Team Sports. AI in Team Sports Organising Committee, 2020. [`pdf`__, `bibtex`__] __ http://doi.acm.org/10.1145/3292500.3330758 __ https://raw.githubusercontent.com/ML-KULeuven/socceraction/master/docs/_static/decroos19.bibtex __ https://limo.libis.be/primo-explore/fulldisplay?docid=LIRIAS2913207&context=L&vid=KULeuven&search_scope=ALL_CONTENT&tab=all_content_tab&lang=en_US __ https://raw.githubusercontent.com/ML-KULeuven/socceraction/master/docs/_static/vanroy20.bibtex