We’re looking for a hands-on full stack who is a backend expert who can build cutting-edge AI platforms to the next level: pixel-perfect UIs, production-grade model-inference services, agentic AI workflows, and seamless integration with third-party LLMs and NLP tooling.
Key Responsibilities
- Build core backend enhancements including APIs, security (OAuth2/JWT, rate-limiting, SecretManager), and observability (structured logging, tracing)
- Add CI/CD pipelines, implement test automation, configure health checks, and create SLO dashboards
- Develop awesome UI interfaces using React.js/Next.js, Redux/Context, Tailwind, MUI, Custom-CSS, Shadcn, and Axios
- Design LLM and agentic services by creating micro/mini-services that host and route to OpenAI, Anthropic, local HF models, embeddings, and RAG pipelines
- Implement autonomous and recursive agents that orchestrate multi-step chains using tools, memory, and planning
- Spin up GPU/CPU inference servers behind an API gateway for model-inference infrastructure
- Optimize throughput with batching, streaming, quantization, and caching using Redis/pgvector
- Own the NLP stack by leveraging transformers for classification, extraction, and embedding generation
- Build data pipelines that integrate aggregated business metrics with model telemetry for analytics
- Mentor juniors to support learning and professional development
Tech You’ll Touch
- Fullstack/Backend, Infra
- Python(or NodeJs), FastAPI, Starlette, Pydantic
- Async SQLAlchemy, Postgres, Alembic, pgvector
- Docker, Kubernetes or ECS/Fargate - AWS (Or) GCP
- Redis/RabbitMQ/Celery (jobs & caching)
- Prometheus, Grafana, OpenTelemetry
- If you are a fullstack, then - react.js/next.js/shadcn/tailwind.css/MUI
- AI/NLP
- HuggingFace Transformers, LangChain / Llama-Index, Torch / TensorRT
- OpenAI, Anthropic, Azure OpenAI, Cohere APIs
- Vector search (Pinecone, Qdrant, PGVector)
- Tooling- Pytest, GitHub Actions and Terraform/CDK preferred
Why does this role matter?
- We are growing. We have projects & products that have different challenges
- This role will expand into learning & contribution in parallel to succeed together
- If you are an engineer who’s looking for the next leap in challenges to set your career on a rocket trajectory - this is an apt role
- You will work in the Founder’s office, replicate the founder & grow organically
- You will close these gaps while leading all future AI service development
Hiring Process
Short call → assignment → live coding/system design 1.5 hour → team fit: 30 minutes →offerAbout Company: C4Scale builds innovative products for startups, enterprises, and planet-scale companies, taking systems from zero to MVP and from MVP to scale. Our expertise spans B2B SaaS products, WebApps, AI models, chatbots, AI-driven applications, and LLM-based solutions, developed for multiple global clients. Recognized as one of the “10 Most Promising SaaS Startups – 2023” by CIOTechOutlook magazine, we take pride in driving impactful solutions. Our founder previously led data at a leading ride-hailing super-app serving over 300 million consumers, enhancing mobility experiences across Southeast Asia. With 10+ AI and SaaS projects delivered across the USA, Ireland, Saudi Arabia, Indonesia, and India, C4Scale is building the future of intelligent products- leveraging deep learning, generative AI, cloud SaaS, and product engineering to deliver innovation at speed and scale.