WeSearch

Cedana (YC S23) Is Hiring

·3 min read · 0 reactions · 0 comments · 13 views
#technology#engineering#ai#hpc#careers
Cedana (YC S23) Is Hiring
⚡ TL;DR · AI summary

Cedana is addressing the challenges of AI and HPC infrastructure by enhancing cluster utilization and reliability through automated GPU checkpointing. The company is seeking a Forward Deployed Engineer to lead customer integrations and optimize platform performance. The role requires extensive experience with SLURM deployments and strong Linux fundamentals.

Key facts
Original article
Y Combinator
Read full at Y Combinator →
Opening excerpt (first ~120 words) tap to expand

Introducing Cedana The Problem AI and HPC infrastructure suffers from scarcity and high costs, so when failures happen they are costly in terms of time and money. Cluster productivity directly determines research output and revenue. Achieving high utilization and throughput is increasingly challenging due to the complexity of workloads, hardware, and operations. Cedana’s Solution Cedana maximizes AI+HPC cluster utilization and reliability with automated GPU checkpointing infrastructure. We enable transparent and fast migration of GPU workloads across instances, without losing work. Workloads automatically migrate to achieve new levels of reliability and throughput while accelerating time to results.

Excerpt limited to ~120 words for fair-use compliance. The full article is at Y Combinator.

Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Threads WhatsApp Bluesky Mastodon Email

Discussion

0 comments

More from Y Combinator