Job Summary:
We are seeking a highly skilled and entrepreneurial Lead AI Applications Engineer to spearhead the development and deployment of cutting-edge applications built on generative AI models. This role focuses on translating state-of-the-art research into scalable, real-world products that drive measurable business impact.
Key Responsibilities:
- Lead the end-to-end development of AI-driven applications, from concept and prototyping to deployment in production environments
- Translate generative AI research into robust, user-centric product solutions
- Architect and implement solutions using AWS serverless infrastructure (Lambda, API Gateway, DynamoDB, etc.)
- Design and deploy AI Agents and Multi-Component Processing (MCP) servers at scale
- Collaborate cross-functionally with product, research, and engineering teams to bring early-stage ideas to market
- Optimize ML infrastructure and ensure high performance, scalability, and security in production systems
Required Qualifications:
- Proven experience in deploying AI/ML applications in production, especially AI Agents and MCP server architectures
- Strong command of Python and hands-on experience with cloud-native development on AWS
- Solid understanding of machine learning infrastructure, including model training, serving, and monitoring in real-world scenarios
- Demonstrated ability to own projects from zero to one — ideally with startup or early-stage product experience
- Familiarity with DevOps practices, CI/CD, and scalable microservices architecture