y. Infosys is unable to provide immigration sponsorship for this role at this time
- Bachelor's degree in Computer Science, AI/ML, or related field.
- 5 years of experience in software engineering or data science, with 2-3 years in Gen AI or LLM-based systems.
- Strong Python programming skills and experience with ML/AI libraries (Hugging Face Transformers, LangChain, PyTorch).
- Hands-on experience with vector databases (FAISS, Pinecone, Weaviate, Azure AI Search).
- Familiarity with cloud platforms and Gen AI services (AWS, Azure, GCP).
- Experience with REST API development (FastAPI, Flask) and containerization (Docker).
- Solid understanding of AI governance, model safety, and prompt engineering.
Key Responsibilities
- Design, develop, and deploy Gen AI applications using LLMs and agentic frameworks (e.g., LangGraph, AutoGen, Crew AI).
- Fine-tune open-source and proprietary LLMs using techniques like LoRA, QLoRA, and PEFT.
- Build and optimize RAG pipelines with hybrid retrieval, semantic chunking, and vector search.
- Integrate Gen AI solutions with cloud-native services (AWS Bedrock, Azure OpenAI, GCP Vertex AI).
- Work with unstructured data (PDFs, HTML, audio, images) and multimodal models.
- Implement LLMOps practices including prompt versioning, caching, observability, and cost tracking.
- Evaluate model performance using tools like RAGAS, DeepEval, and FMeval.
- Collaborate with product managers, data engineers, and UX teams to deliver production-ready solutions.
- Mentor junior engineers and contribute to code reviews, design discussions, and best practices.
Preferred Qualifications:
- Exposure to agentic workflows and autonomous agents.
- Experience with CI/CD pipelines and DevOps tools (GitHub Actions, Jenkins, Terraform).
- Familiarity with front-end integration (React, Angular, TypeScript) and GraphQL APIs.
- Knowledge of model interpretability, bias mitigation, and human-in-the-loop systems.
- Experience with multimodal models and perception systems (e.g., vision + language).
The job entails sitting as well as working at a computer for extended periods of time. Should be able to communicate by telephone, email or face-to-face.
Estimated annual compensation range for the candidate based in the below location will be:
Ontario: $ 92740 to $ 123375