Similar Tracks
[SPCL_Bcast #50] Hardware-aware Algorithms for Language Modeling
Scalable Parallel Computing Lab, SPCL @ ETH Zurich
Petar Veličković: Graph Deep Learning: Monoids and time, Embracing asynchrony in (G)NNs
Machine Learning and Dynamical Systems Seminar
[SPCL_Bcast] Measurement and Analysis of Application Performance on Exascale GPU-accelerated Systems
Scalable Parallel Computing Lab, SPCL @ ETH Zurich
From Large Language Models to Reasoning Language Models - Three Eras in The Age of Computation.
Scalable Parallel Computing Lab, SPCL @ ETH Zurich
All models are wrong, some are useful: Model Selection with Limited Labels
Scalable Parallel Computing Lab, SPCL @ ETH Zurich
[SPCL_Bcast] Data Selection - Data Challenges when Training Generative Models
Scalable Parallel Computing Lab, SPCL @ ETH Zurich
[SPCL_Bcast] Merging and MoErging for compositional generalization
Scalable Parallel Computing Lab, SPCL @ ETH Zurich
[SPCL_Bcast #51] Neural Network Quantization with Brevitas
Scalable Parallel Computing Lab, SPCL @ ETH Zurich
Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects
Scalable Parallel Computing Lab, SPCL @ ETH Zurich
[SPCL_Bcast #53] The evolution of accelerator-centric GPU services - past, present, future
Scalable Parallel Computing Lab, SPCL @ ETH Zurich