diff --git a/_gsocorgs/2025/bnl.md b/_gsocorgs/2025/bnl.md new file mode 100644 index 000000000..f70444ccc --- /dev/null +++ b/_gsocorgs/2025/bnl.md @@ -0,0 +1,14 @@ +--- +title: "Brookhaven National Laboratory" +author: "Michel Hernandez Villanueva" +layout: default +organization: BNL +logo: BNL-logo.png +description: | + Brookhaven National Laboratory (BNL) is a multipurpose research laboratory located in Upton, New York. + It is operated by Brookhaven Science Associates for the U.S. Department of Energy. + It hosts the Relativistic Heavy Ion Collider, the future Electron-Ion Collider and the National Synchrotron Light Source II. + BNL scientists are part of major HEP experiments, such as ATLAS, Belle II, and DUNE. +--- + +{% include gsoc_proposal.ext %} \ No newline at end of file diff --git a/_gsocprojects/2025/project_HSFCondDB.md b/_gsocprojects/2025/project_HSFCondDB.md new file mode 100644 index 000000000..63fd3089b --- /dev/null +++ b/_gsocprojects/2025/project_HSFCondDB.md @@ -0,0 +1,24 @@ +--- +project: HSFCondDB +title: HSF Conditions Database +layout: default +description: | + In high-energy physics (HEP), conditions databases play a critical role in managing non-event data. + This includes calibration constants, alignment parameters, and detector conditions, which evolve over time. + These databases ensure that analysis software can access the correct calibration and alignment data corresponding to + the detector’s state at any given time, enabling accurate physics measurements. + + The [HEP Software Foundation](https://hepsoftwarefoundation.org/) (HSF) proposes a Conditions Database reference + for HEP and Nuclear Physics experiments around the world. Several experts have converged on a common design for + conditions data access management [arXiv:1901.05429](https://arxiv.org/abs/1901.05429). + The [nopayloaddb](https://github.com/BNLNPPS/nopayloaddb) is an implementation of this reference. It has been + successfully operating within the [sPHENIX](https://www.sphenix.bnl.gov/) experiment for nearly two years + and is currently being adopted by [Belle II](https://www.belle2.org/). Additionally, other collaborations, including + [ePIC](https://www.bnl.gov/eic/epic.php) and the [Einstein Telescope](https://www.et-gw.eu/), have expressed interest + in evaluating its suitability for their needs. +summary: | + The [Nopayloaddb](https://github.com/BNLNPPS/nopayloaddb) is an implementation of the HSF + Conditions Database reference, an experiment-agnostic design for conditions data access management. +--- + +{% include gsoc_project.ext %} diff --git a/_gsocproposals/2025/proposal_HSFCondDB_AILogAnalysis.md b/_gsocproposals/2025/proposal_HSFCondDB_AILogAnalysis.md new file mode 100644 index 000000000..ab1982608 --- /dev/null +++ b/_gsocproposals/2025/proposal_HSFCondDB_AILogAnalysis.md @@ -0,0 +1,59 @@ +--- +title: Intelligent Log Analysis for the HSF Conditions Database +layout: gsoc_proposal +project: HSFCondDB +year: 2025 +difficulty: medium +duration: 350 +mentor_avail: June-October +organization: + - BNL +--- + +## Description + +The [nopayloaddb](https://github.com/BNLNPPS/nopayloaddb) project works as an implementation of the Conditions Database +reference for the HSF. It provides a RESTful API for managing payloads, global tags, payload types, and associated data. + +Our current system, composed of Nginx, Django, and database ([link to helm chart](https://github.com/BNLNPPS/nopayloaddb-charts)), +lacks a centralized logging solution making it difficult to effectively monitor and troubleshoot issues. +This task will address this deficiency by implementing a centralized logging system aggregating logs from multiple +components, and develop a machine learning model to perform intelligent log analysis. The model will identify unusual +log entries indicative of software bugs, database bottlenecks, or other performance issues, allowing us to address +problems before they escalate. Additionally, by analyzing system metrics, the model will provide insights for an optimal +adjustment of parameters during periods of increased request rates. + +## Steps + +1. Set up a centralized logging system +2. Collect and structure logs from Nginx, Django, and the database +3. Develop an ML model for log grouping and anomaly detection +4. Implement Kubernetes-based database with replication +5. Train an ML model to optimize Kubernetes parameters dynamically + + +## Expected Results + +* A centralized logging system for improved monitoring and troubleshooting +* ML-powered anomaly detection +* ML-driven dynamic configuration for optimal performance + +## Requirements + +* Python and basic understanding of ML frameworks +* Kubernetes, basic understanding, k8s, Helm, Operators, OpenShift +* Django and Nginx, basic understanding of web frameworks and logging +* Database knowledge, PostgreSQL, database replication + + +## Mentors + +- **Ruslan Mashinistov [mashinistov@bnl.gov](mailto:mashinistov@bnl.gov) BNL** +- John S. De Stefano Jr. [jd@bnl.gov](mailto:jd@bnl.gov) BNL +- Michel Hernandez Villanueva [mhernande1@bnl.gov](mailto:mhernande1@bnl.gov) BNL + + +## Links + +* Django REST API: https://github.com/BNLNPPS/nopayloaddb +* Automized deployment with helm-chart: https://github.com/BNLNPPS/nopayloaddb-charts diff --git a/gsoc/2025/mentors.md b/gsoc/2025/mentors.md index b4ad87139..c13a06fcd 100644 --- a/gsoc/2025/mentors.md +++ b/gsoc/2025/mentors.md @@ -20,15 +20,18 @@ layout: plain * David Lange [david.lange@cern.ch](mailto:david.lange@cern.ch) CompRes * Serguei Linev [S.Linev@gsi.de](mailto:S.Linev@gsi.de) GSI * Johan Mabille [johan.mabille@quantstack.net](mailto:johan.mabille@quantstack.net) QuantStack +* Ruslan Mashinistov [mashinistov@bnl.gov](mailto:mashinistov@bnl.gov) BNL * Peter McKeown [peter.mckeown@cern.ch](mailto:peter.mckeown@cern.ch) CERN * Felice Pantaleo [felice.pantaleo@cern.ch](mailto:felice.pantaleo@cern.ch) CERN * Giacomo Parolini [giacomo.parolini@cern.ch](mailto:giacomo.parolini@cern.ch) CERN * Alexander Penev [alexander.p.penev@gmail.com](mailto:alexander.p.penev@gmail.com) CompRes/University of Plovdiv, BG * Mayank Sharma [mayank.sharma@cern.ch](mailto:mayank.sharma@cern.ch) UMich * Simon Spannagel [simon.spannagel@desy.de](mailto:simon.spannagel@desy.de) DESY +* John De Stefano [jd@bnl.gov](mailto:jd@bnl.gov) BNL * Graeme Stewart [graeme.andrew.stewart@cern.ch](mailto:graeme.andrew.stewart@cern.ch) CERN * Maciej Szymański [maciej.szymanski@cern.ch](mailto:maciej.szymanski@cern.ch) ANL * Peter Van Gemmeren [peter.van.gemmeren@cern.ch](mailto:peter.van.gemmeren@cern.ch) ANL * Martin Vasilev [mvassilev@uni-plovdiv.bg](mailto:mvassilev@uni-plovdiv.bg) University of Plovdiv, BG * Vassil Vassilev [vvasilev@cern.ch](mailto:vvasilev@cern.ch) CompRes +* Michel Hernandez Villanueva [mhernande1@bnl.gov](mailto:mhernande1@bnl.gov) BNL * Valentin Volkl [valentin.volkl@cern.ch](mailto:valentin.volkl@cern.ch) CERN diff --git a/images/BNL-logo.png b/images/BNL-logo.png index df3920684..6d7c9af0c 100644 Binary files a/images/BNL-logo.png and b/images/BNL-logo.png differ