Back to Jobs

Backend Engineer - AI Runtime

Remote, USA Full-time Posted 2026-06-17

About Us

We are a stealth-mode startup building the new AI runtime. Our mission is to make advanced language models deployable, customizable, and secure across diverse environments.

Role

We are seeking a Backend Engineer (Node.js/NestJS) to extend our platform using our existing codebase. You'll build the proxy backend that interacts with our custom inference runtime and extend dashboards.

This role requires strong backend engineering skills, an ability to integrate existing systems, and comfort working closely with C++ engineers who are building low-level runtime features using CUDA.

Responsibilities

Proxy Backend for Inference Runtime

  • Build and maintain a Node.js-based proxy backend that:
    • Accepts inference requests from the frontend.

    • Schedules and serializes prompts.

    • Manages QKV cache load/unload (API hooks from the C++ runtime).
    • Provides APIs to manage LoRA adapters.

  • Integrate with authentication, RBAC, and logging already provided by the existing stack.
  • Expose metrics and logs for monitoring inference usage and performance.

Dashboards

  • Extend existing Dashboard: Dataset upload, training job view, model management, inference usage, request history, and adapter selection.
  • Reuse auth, billing, and user management code (Auth0, Stripe).

  • Add necessary backend endpoints to support new UI flows.

Core Stack & Infrastructure

  • Develop using NestJS as the main backend framework.
  • Work with PostgreSQL, Redis, MongoDB, and HashiCorp Vault for persistence, caching, and secrets.
  • Use Socket.IO for real-time updates (job status, inference progress).
  • Ensure secure integration with Stripe (billing) and Auth0 (identity).
  • Collaborate with DevOps on deployment pipelines.

Requirements

  • Deep knowledge of the JavaScript and TypeScript languages.

  • Strong experience with Node.js and NestJS framework.

  • Proficiency in PostgreSQL and Redis for persistence and caching.

  • Hands-on experience with Socket.IO or other WebSocket libraries.

  • Experience with secure configuration and secrets management (HashiCorp Vault preferred).
  • Experience with JWKS.
  • Comfortable working with microservices and integrating with existing codebases.
  • Strong debugging and systems thinking,  able to reason about scheduling, state management, and concurrency.

Nice to Have

  • Experience integrating with AI runtimes (gRPC/REST backends for inference).
  • Experience with RAG and MCP.
  • Experience with authentication/authorization frameworks (Auth0, JWT, RBAC).
  • Familiarity with Stripe API or similar billing systems.

  • Contributions to backend open-source projects.

  • Experience with WebRTC.

Why Join

  • Extend a proven SaaS foundation into a new AI runtime platform.

  • Work directly with a C++ systems team building custom inference features.
  • Build real products (dashboards + runtime APIs) used by vendors and customers.
  • Competitive compensation, equity potential.

Please use this link to apply to this job:  https://www.baasi.com/career/apply/3164212

Apply To This Job

Similar Jobs

Senior C++ Programmer - Slicer Software Maintenance & 3D Development

Remote, USA Full-time

Paralegal Asbestos Affidavit Preparer ( Asbestos Law Firm Experience Required)

Remote, USA Full-time

Legal Intake Department

Remote, USA Full-time

Asbestos Paralegal - Medical Records Department (Asbestos Experience Requred)

Remote, USA Full-time

Sr React.js Engineer - 3 Months Contract

Remote, USA Full-time

Microsoft 365 Security and Compliance Specialist

Remote, USA Full-time

Marketing Manager

Remote, USA Full-time

Quality Assurance Engineer

Remote, USA Full-time

Commercial HVAC Technician

Remote, USA Full-time

Commercial HVAC Technician

Remote, USA Full-time

Experienced Tele Sales Remote Customer Support Specialist (Entry Level / Part Time) – Deliver Exceptional Customer Experiences at blithequark

Remote, USA Full-time

Experienced Loan Servicing Customer Service Representative – Real Estate Lending Expertise Required

Remote, USA Full-time

Direct Care Monitors

Remote, USA Full-time

American Express Virtual Assistant ( Work At Home )

Remote, USA Full-time

Transplant Liaison

Remote, USA Full-time

[Remote/WFM] Staff Fullstack Software Engineer, Search

Remote, USA Full-time

Distribution Specialist job at HUB International in Austin, TX

Remote, USA Full-time

Experienced Full Stack Staff Assistant I, Technical Operations – Remote Data Entry and Administrative Support

Remote, USA Full-time

Experienced Data Entry Clerk – Administrative Support for arenaflex Operations in Davie, FL

Remote, USA Full-time

Join Today: Travel Assistant (Remote)

Remote, USA Full-time