Magic (magic.dev) Logo

Magic (magic.dev)

HPC Networking Lead

Job Posted 18 Days Ago Posted 18 Days Ago
Be an Early Applicant
Remote
2 Locations
100K Annually
Senior level
Remote
2 Locations
100K Annually
Senior level
The HPC Networking Lead will optimize communication in distributed training by implementing algorithms, designing monitoring systems, and enhancing overall network performance.
The summary above was generated by AI

Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alone. Our approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal.

About the role:

As the HPC networking lead, you will be the lead technical contributor to an internal NCCL-like library, aiming to optimize performance for communication patterns our workloads require.

What you might work on: 

  • Implement and tune custom collective communication algorithms for specific topologies

  • Design and implement networking monitoring systems for our training clusters

  • Implement and benchmark collective communication primitives to achieve low latency and high throughput

  • Contribute to the development of debugging and profiling tools for network communication performance analysis

  • Integrate new communication techniques into the overall system architecture

What we’re looking for: 

  • Deep understanding of sharding techniques used in distributed training (pipeline/tensor/data parallelism) 

  • Experience contributing to a collective communication library such as NCCL or an MPI implementation

  • Expert understanding of RoCE/IB RDMA networks and have written distributed algorithms using RDMA

  • A track record of contributing to open-source projects related to high-performance networking

Magic strives to be the place where high-potential individuals can do their best work. We value quick learning and grit just as much as skill and experience. 

Our culture:

  • Integrity. Words and actions should be aligned

  • Hands-on. At Magic, everyone is building 

  • Teamwork. We move as one team, not N individuals

  • Focus. Safely deploy AGI. Everything else is noise

  • Quality. Magic should feel like magic

Compensation, benefits and perks (US):

  • Annual salary range: $100K - $550K

  • Equity is a significant part of total compensation, in addition to salary

  • 401(k) plan with 6% salary matching

  • Generous health, dental and vision insurance for you and your dependents

  • Unlimited paid time off

  • Visa sponsorship and relocation stipend to bring you to SF, if possible

  • A small, fast-paced, highly focused team

Top Skills

Distributed Algorithms
Ib Rdma
Mpi
Nccl
Networking Monitoring Systems
Roce

Similar Jobs

An Hour Ago
Remote
United States
196K-265K Annually
Senior level
196K-265K Annually
Senior level
Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy
The Senior Backend Product Software Engineer at Dropbox will develop and enhance products, mentor juniors, and ensure operational continuity in a fast-paced environment.
Top Skills: GoHTML/CSSJavaJavaScriptMySQLPythonReactRust
2 Hours Ago
Remote
Hybrid
5 Locations
Senior level
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
The Principal Compiler Engineer will enhance the V8 compiler for Cloudflare's Workers Runtime, focusing on performance and scalability improvements in a distributed environment.
Top Skills: C++JavaScriptLinuxRustV8Webassembly
2 Hours Ago
Easy Apply
Remote
United States
Easy Apply
Senior level
Senior level
Healthtech • Software
Lead a software engineering team to build impactful healthcare technology, focusing on integrations, data services, and maintaining high standards of software quality and security.
Top Skills: Aws CloudwatchDraw.IoFigmaGithub ActionsJava (Spring)JenkinsLucidchartNoSQLSQL

What you need to know about the Austin Tech Scene

Austin has a diverse and thriving tech ecosystem thanks to home-grown companies like Dell and major campuses for IBM, AMD and Apple. The state’s flagship university, the University of Texas at Austin, is known for its engineering school, and the city is known for its annual South by Southwest tech and media conference. Austin’s tech scene spans many verticals, but it’s particularly known for hardware, including semiconductors, as well as AI, biotechnology and cloud computing. And its food and music scene, low taxes and favorable climate has made the city a destination for tech workers from across the country.

Key Facts About Austin Tech

  • Number of Tech Workers: 180,500; 13.7% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Dell, IBM, AMD, Apple, Alphabet
  • Key Industries: Artificial intelligence, hardware, cloud computing, software, healthtech
  • Funding Landscape: $4.5 billion in VC funding in 2024 (Pitchbook)
  • Notable Investors: Live Oak Ventures, Austin Ventures, Hinge Capital, Gigafund, KdT Ventures, Next Coast Ventures, Silverton Partners
  • Research Centers and Universities: University of Texas, Southwestern University, Texas State University, Center for Complex Quantum Systems, Oden Institute for Computational Engineering and Sciences, Texas Advanced Computing Center
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account