Huawei Zurich Tech Arena

Participants will be asked to define a general direct network topology framework that defines the connections among switch nodes, such that the cluster can achieve the maximum network scale（supports maximum number of GPUs in the cluster / approximately approaches the Moore bound).

Modeling the communication efficiency of the network topology for AllReduce and AlltoAll primitives；Devise the AllReduce/AlltoAll algorithm that achieves the ideal/optimal performance.

Describe the modularity of the network topology (how it can be constructed into practice).

Compare the proposed network topology with classical topologies (e.g., CLOS, Dragonfly, Dragonfly+) regarding their advantages, disadvantages, and application scope.

The challenge consists in implementing one-shot compression of the Llama-3.1-B model with the goal of achieving the highest compression rate at the lowest accuracy degradation with respect to the original model.

To implement one-shot compression, participants are allowed to implement pruning and weight-only quantization, but cannot re-train and fine-tune the compressed model to improve its accuracy.

Participants are required to integrate their solution in the popular lm-evaluation-harness benchmarking framework. A docker container is provided that contains the model development and evaluation environment.

WHY JOIN?

Collaborate

Showcase your expertise

Get hands-on experience

Gain insights

Compete

Network

WHO CAN PARTICIPATE?

CHALLENGES

PRIZE POOL

AGENDA

1

Registrations Open: November 4th

2

Challenge 1 released: November 18th

3

Challenge 2 released: November 25th

4

Registrations close: November 28th

5

Final Submissions: December 2nd

6

Finalists announced: December 9th

7

Onsite Demo day: December 16th

FINAL HACKATHON

ONSITE VENUE

ANY QUESTIONS?