We’re looking for a hands-on full stack who is a backend expert who can build cutting-edge AI platforms to the next level: pixel-perfect UIs, production-grade model-inference services, agentic AI workflows, and seamless integration with third-party LLMs and NLP tooling. Key Responsibilities Build core backend enhancements including APIs, security (OAuth2/JWT, rate-limiting, SecretManager), and observability (structured logging, tracing) Add CI/CD pipelines, implement test automation, configure health checks, and create SLO dashboards Develop awesome UI interfaces using React.js/Next.js, Redux/Context, Tailwind, MUI, Custom-CSS, Shadcn, and Axios Design LLM and agentic services by creating micro/mini-services that host and route to OpenAI, Anthropic, local HF models, embeddings, and RAG pipelines Implement autonomous and recursive agents that orchestrate multi-step chains using tools, memory, and planning Spin up GPU/CPU inference servers behind an API gateway for model-inference infrastructure Optimize throughput with batching, streaming, quantization, and caching using Redis/pgvector Own the NLP stack by leveraging transformers for classification, extraction, and embedding generation Build data pipelines that integrate aggregated business metrics with model telemetry for analytics Mentor juniors to support learning and professional development Tech You’ll Touch Fullstack/Backend, Infra Python(or NodeJs), FastAPI, Starlette, Pydantic Async SQLAlchemy, Postgres, Alembic, pgvector Docker, Kubernetes or ECS/Fargate - AWS (Or) GCP Redis/RabbitMQ/Celery (jobs & caching) Prometheus, Grafana, OpenTelemetry If you are a fullstack, then - react.js/next.js/shadcn/tailwind.css/MUI AI/NLP HuggingFace Transformers, LangChain / Llama-Index, Torch / TensorRT OpenAI, Anthropic, Azure OpenAI, Cohere APIs Vector search (Pinecone, Qdrant, PGVector) Tooling- Pytest, GitHub Actions and Terraform/CDK preferred Why does this role matter? We are growing. We have projects & products that have different challenges This role will expand into learning & contribution in parallel to succeed together If you are an engineer who’s looking for the next leap in challenges to set your career on a rocket trajectory - this is an apt role You will work in the Founder’s office, replicate the founder & grow organically You will close these gaps while leading all future AI service development Hiring Process Short call → assignment → live coding/system design 1.5 hour → team fit: 30 minutes →offer About Company: C4Scale builds innovative products for startups, enterprises, and planet-scale companies, taking systems from zero to MVP and from MVP to scale. Our expertise spans B2B SaaS products, WebApps, AI models, chatbots, AI-driven applications, and LLM-based solutions, developed for multiple global clients. Recognized as one of the “10 Most Promising SaaS Startups – 2023” by CIOTechOutlook magazine, we take pride in driving impactful solutions. Our founder previously led data at a leading ride-hailing super-app serving over 300 million consumers, enhancing mobility experiences across Southeast Asia. With 10+ AI and SaaS projects delivered across the USA, Ireland, Saudi Arabia, Indonesia, and India, C4Scale is building the future of intelligent products- leveraging deep learning, generative AI, cloud SaaS, and product engineering to deliver innovation at speed and scale.
About the job: We're looking for a hands-on full stack who is a backend expert who can build cutting-edge AI platforms to the next level: pixel-perfect UIs, production-grade model-inference services, agentic AI workflows, and seamless integration with third-party LLMs and NLP tooling. Key Responsibilities: 1. Build core backend enhancements including APIs, security (OAuth2/JWT, rate-limiting, SecretManager), and observability (structured logging, tracing) 2. Add CI/CD pipelines, implement test automation, configure health checks, and create SLO dashboards 3. Develop awesome UI interfaces using React.js/Next.js, Redux/Context, Tailwind, MUI, Custom-CSS, Shadcn, and Axios 4. Design LLM and agentic services by creating micro/mini-services that host and route to OpenAI, Anthropic, local HF models, embeddings, and RAG pipelines 5. Implement autonomous and recursive agents that orchestrate multi-step chains using tools, memory, and planning 6. Spin up GPU/CPU inference servers behind an API gateway for model-inference infrastructure 7. Optimize throughput with batching, streaming, quantization, and caching using Redis/pgvector 8. Own the NLP stack by leveraging transformers for classification, extraction, and embedding generation 9. Build data pipelines that integrate aggregated business metrics with model telemetry for analytics 10. Mentor juniors to support learning and professional development Tech you'll touch: 1. Fullstack/Backend, Infra 2. Python(or NodeJs), FastAPI, Starlette, Pydantic 3. Async SQLAlchemy, Postgres, Alembic, pgvector 4. Docker, Kubernetes or ECS/Fargate - AWS (Or) GCP 5. Redis/RabbitMQ/Celery (jobs & caching) 6. Prometheus, Grafana, OpenTelemetry 7. If you are a fullstack, then - react.js/next.js/shadcn/tailwind.css/MUI 8. AI/NLP 9. HuggingFace Transformers, LangChain / Llama-Index, Torch / TensorRT 10. OpenAI, Anthropic, Azure OpenAI, Cohere APIs 11. Vector search (Pinecone, Qdrant, PGVector) 12. Tooling- Pytest, GitHub Actions and Terraform/CDK preferred Why does this role matter? 1. We are growing. We have projects & products that have different challenges 2. This role will expand into learning & contribution in parallel to succeed together 3. If you are an engineer who's looking for the next leap in challenges to set your career on a rocket trajectory - this is an apt role 4. You will work in the Founder's office, replicate the founder & grow organically 5. You will close these gaps while leading all future AI service development Hiring process: Short call assignment live coding/system design 1.5 hour team fit: 30 minutes offer Who can apply: Only those candidates can apply who: have minimum 1 years of experience Salary: ₹ 3,00,000 - 6,00,000 /year Experience: 1 year(s) Deadline: 2025-10-11 23:59:59 Skills required: React, Tailwind CSS, Next.js, Axios and Shadcn Other Requirements: A. Must-have experience: 1. 1+ yrs building production Python(or Nodejs) REST APIs (FastAPI, Flask 2. or Django-REST) 3. SQL schema design & query optimization in Postgres (CTEs, JSONB) 4. Deep knowledge of async patterns & concurrency (asyncio, AnyIO, celery) 5. Crafted awesome UI Applications that integrate with the backend API 6. Hands-on with LLM/embedding workflows, prompt-engineering, and at least one of “agent-ops” 7. frameworks (LangGraph, CrewAI, AutoGen) 8. Cloud container orchestration (Any of K8s, ECS, GKE, AKS, etc.) 9. CI/CD pipelines and infra-as-code B. Nice-to-have: 1. Streaming protocols (Server-Sent Events, WebSockets, gRPC) 2. NGINX Ingress/AWS API Gateway 3. RBAC / multi-tenant SaaS security hardening 4. Data privacy, PII redaction, secure key vault integrations 5. Bitemporal or event-sourced data models About Company: C4Scale builds innovative products for startups, enterprises, and planet-scale companies, taking systems from zero to MVP and from MVP to scale. Our expertise spans B2B SaaS products, WebApps, AI models, chatbots, AI-driven applications, and LLM-based solutions, developed for multiple global clients. Recognized as one of the '10 Most Promising SaaS Startups - 2023' by CIOTechOutlook magazine, we take pride in driving impactful solutions. Our founder previously led data at a leading ride-hailing super-app serving over 300 million consumers, enhancing mobility experiences across Southeast Asia. With 10+ AI and SaaS projects delivered across the USA, Ireland, Saudi Arabia, Indonesia, and India, C4Scale is building the future of intelligent products- leveraging deep learning, generative AI, cloud SaaS, and product engineering to deliver innovation at speed and scale.
As a Full Stack Engineer at C4Scale, you will be responsible for building cutting-edge AI platforms to the next level, including pixel-perfect UIs, production-grade model-inference services, agentic AI workflows, and seamless integration with third-party LLMs and NLP tooling. **Key Responsibilities:** - Build core backend enhancements such as APIs, security (OAuth2/JWT, rate-limiting, SecretManager), and observability (structured logging, tracing). - Add CI/CD pipelines, implement test automation, configure health checks, and create SLO dashboards. - Develop UI interfaces using technologies like React.js/Next.js, Redux/Context, Tailwind, MUI, Custom-CSS, Shadcn, and Axios. - Design LLM and agentic services by creating micro/mini-services hosting and routing to OpenAI, Anthropic, local HF models, embeddings, and RAG pipelines. - Implement autonomous and recursive agents that orchestrate multi-step chains using tools, memory, and planning. - Spin up GPU/CPU inference servers behind an API gateway for model-inference infrastructure. - Optimize throughput with techniques like batching, streaming, quantization, and caching using Redis/pgvector. - Own the NLP stack by leveraging transformers for classification, extraction, and embedding generation. - Build data pipelines that integrate aggregated business metrics with model telemetry for analytics. - Mentor juniors to support learning and professional development. **Tech You'll Touch:** - Fullstack/Backend, Infra: Python (or NodeJs), FastAPI, Starlette, Pydantic. - Async SQLAlchemy, Postgres, Alembic, pgvector. - Docker, Kubernetes or ECS/Fargate - AWS (Or) GCP. - Redis/RabbitMQ/Celery (jobs & caching). - Prometheus, Grafana, OpenTelemetry. - AI/NLP: HuggingFace Transformers, LangChain / Llama-Index, Torch / TensorRT. - OpenAI, Anthropic, Azure OpenAI, Cohere APIs. - Vector search (Pinecone, Qdrant, PGVector). - Tooling: Pytest, GitHub Actions, and Terraform/CDK preferred. **Why does this role matter ** We are growing with diverse projects & products, offering different challenges. This role will provide opportunities for learning & contribution in parallel to ensure mutual success. If you are an engineer seeking a challenging role to propel your career forward, this is the right opportunity for you. You will work in the Founder's office, replicate the founder's vision, and grow organically while leading all future AI service development. In the hiring process, you can expect a short call, assignment, live coding/system design session of 1.5 hours, team fit interview lasting 30 minutes, and finally an offer. (Note: The additional details about the company have been omitted as they were not relevant to the job description.),
Were looking for a hands-on fullstack who is a backend expert who can build cutting edge AI platforms to the next level: pixel-perfect UIs, production-grade model-inference services, agentic AI workflows, and seamless integration with third-party LLMs and NLP tooling.Role & responsibilities. What you'll build • Core Backend Enhancements - Build APIs and awesome UI Interfaces - React.js/Next.js, Redact/Context, Tailwind / MUI / Custom-CSS / Shadcn / Axios • LLM & Agentic Services Design micro/mini-services that host and route to OpenAI, Anthropic, local HF models, embeddings & RAG pipelines Implement autonomous/recursive agents that orchestrate multi-step chains (Tools, Memory, Planning) • Model-Inference Infrastructure – Spin up GPU / CPU inference servers behind an API gateway – Optimize throughput with batching, streaming, quantization & caching (Redis / pgvector) • Mentor Juniors Tech you’ll touch Fullstack/Backend | Infra • Python(or NodeJs), FastAPI, Starlette, Pydantic - If you are a fullstack then - react.js/next.js/shadcn/tailwind.css/MUI • Async SQLAlchemy, Postgres, Alembic, pgvector • Redis / Celery (jobs & caching) - Docker, Kubernetes or ECS/Fargate - AWS (Or) GCP • HuggingFace Transformers, LangChain / Llama-Index, Torch / TensorRT • OpenAI, Anthropic, Azure OpenAI, Cohere APIs • Vector search (Pinecone, Qdrant, PGVector) Tooling • Pytest, GitHub Actions Must-have experience • 2+ to 6+ yrs building production Python(or Nodejs) REST APIs (FastAPI, Flask or Django-REST) • Crafted awesome UI Applications that integrates with backend API • SQL schema design & query optimization in Postgres (CTEs, JSONB) • Deep knowledge of async patterns & concurrency (asyncio, AnyIO, celery) • Hands-on with LLM/embedding workflows, prompt-engineering, and at least one of “agent-ops” frameworks (LangGraph, CrewAI, AutoGen) • Cloud container orchestration (Any of K8s, ECS, GKE, AKS, etc.) • CI/CD pipelines and infra-as-code Nice-to-have • Voice based AI & Knowledge Base Search Systems applications expertise Why does this role matter ? We are growing. We have products that have different challenges This role will expand into learning & contribution parallelly to succeed together You will close these gaps while leading all future AI service development. If you are an engineer who’s looking for the next leap in challenges to set your career in rocket trajectory - this is an apt role You will work in the Founder’s office, replicate the founder & grow organically Hiring process ShortCall Assignment, Live coding/System Design 1.5 hour, Team fit: 30 minutes, Offer
FIND ON MAP