Apexon is a digital-first technology services firm specializing in accelerating business transformation and delivering human-centric digital experiences. We have been meeting customers wherever they are in the digital lifecycle and helping them outperform their competition through speed and innovation.Apexon brings together distinct core competencies in AI, analytics, app development, cloud, commerce, CX, data, DevOps, IoT, mobile, quality engineering and UX, and our deep expertise in BFSI, healthcare, and life sciences to help businesses capitalize on the unlimited opportunities digital offers. Our reputation is built on a comprehensive suite of engineering services, a dedication to solving clients toughest technology problems, and a commitment to continuous improvement. Backed by Goldman Sachs Asset Management and Everstone Capital, Apexon now has a global presence of 15 offices (and 10 delivery centers) across four continents.
We enable #HumanFirstDigital
Gen AI Consultant:
Key Responsibilities:
Design and implement GenAI-powered content summarization workflows using AWS services.
Utilize Amazon Bedrock, SageMaker, or custom LLMs on EC2/EKS for building and deploying summarization models.
Build and manage embedding pipelines using models like Amazon Titan Embeddings, OpenAI, or Hugging Face models.
Integrate and optimize vector databases (e.g., Pinecone, Amazon OpenSearch, Weaviate, FAISS, or Milvus) for efficient semantic retrieval.
Develop APIs and interfaces to access summarization outputs in real time.
Ensure system scalability, performance, and security following AWS best practices.
Collaborate with business and product stakeholders to understand use cases and refine output formats.
Stay updated with the latest in GenAI, prompt engineering, and retrieval-augmented generation (RAG) techniques.
Required Skills:
3 6+ years of experience in AI/ML, NLP, or data science.
Proven experience with AWS AI/ML services: Amazon Bedrock, SageMaker, Lambda, Step Functions, S3, API Gateway, IAM, etc.
Hands-on experience with LLMs (OpenAI, Anthropic, Cohere, etc.) and content summarization techniques.
Proficiency in Python, especially for AI workflows and AWS SDK (boto3).
Deep understanding of vector databases and semantic search.
Experience with RAG (Retrieval-Augmented Generation) architectures.
Familiarity with prompt engineering and LLM tuning strategies.
Strong problem-solving, communication, and stakeholder management skills.
Our Commitment to Diversity & Inclusion:
Our Perks and Benefits:
Our benefits and rewards program has been thoughtfully designed to recognize your skills and contributions, elevate your learning/upskilling experience and provide care and support for you and your loved ones. As an Apexon Associate, you get continuous skill-based development, opportunities for career advancement, and access to comprehensive health and well-being benefits and assistance.
We also offer:
o Group Health Insurance covering family of 4
o Term Insurance and Accident Insurance
o Paid Holidays & Earned Leaves
o Paid Parental LeaveoLearning & Career Development
o Employee Wellness
Job Location :
Ahmedabad, India