Join a stealth-mode startup building out their AI and cloud platform, powered by thousands of H100s, H200s, and B200s, ready for experimentation, full-scale model training, or inference. As a Platform Engineer/Senior Site Reliability Engineer, you’ll own the reliability, performance, and automation of this GPU-powered infrastructure, ensuring seamless orchestration across environments managed by Slurm, Kubernetes, or direct SSH access. As well as supporting their extremely exciting new products coming to the market!
This is a rare opportunity to work at the intersection of AI infrastructure and AI, shaping the operational backbone of one of the largest GPU clusters in private deployment.
If you want to build and operate infrastructure for frontier AI workloads, automate systems at petascale, and be part of a founding engineering team, this is the place to do it. Get in touch and apply today!
...Winter Park and Granby Colorado. Our diverse portfolio features a wide range of dining experiences, including casual fine dining, vibrant bar and grills, and deli & catering operations. We have one happy problem: our dining rooms are buzzing and there are plenty of tables...
...Dental insurance~Free uniforms~Health insurance~Opportunity for advancement~Paid time off~Training & development~Vision insuranceReports To: General Manager Classification: Full-Time, Non-ExemptSchedule: Monday-Thursday and Saturday. Days off:...
...Radiologist Titan Placement Group invites you to explore an opportunity to join a well-established healthcare facility in Skowhegan, Maine. Our Client in Central Maine is seeking to hire a BC/BE full time general radiologist. This position offers unique onsite and...
Description:, including hands-on processing experience and knowledge of current approaches and standards for describing archives and graphic materials. ~ Knowledge of best practices for handling, organizing, describing, and preserving photographic archives, including...
...Systematic approach to problem-solving and a proven ability to guide others in adopting the SRE mentality. Responsibilities: Design systems for ultra-high reliability and fault tolerance in mission-critical clearing operations under stringent regulatory standards...