Job Description – Senior AI/Cloud Engineer (Full Stack / Microservices)
Core Responsibilities
1. Microservices Architecture
Architect, design, and develop microservices-based applications to ensure scalability and maintainability.
2. Containerization
Utilize containerization technologies to package and deploy application components consistently across environments.
3. Orchestration & Deployment
Implement and manage container orchestration for automated deployment, scaling, and operation of services.
4. DevOps Implementation
Collaborate with cross-functional teams to implement DevOps practices including CI/CD pipelines, Infrastructure as Code (IaC), and automated testing.
5. Scalability Optimization
Architect horizontally scalable systems to handle dynamic workloads efficiently.
6. Resilience & Reliability
Design fault-tolerant systems ensuring high availability and rapid recovery from failures.
7. Monitoring & Logging
Implement robust monitoring and logging using Prometheus and Grafana.
8. API-First Design
Champion API-first development with well-defined, documented APIs for seamless communication between services.
9. Leadership & Mentorship
Mentor developers, conduct code/design reviews, and promote engineering best practices.
10. AI/ML Development
Build AI/ML solutions such as:
- Document understanding
- Summarization systems
- Chatbots
11. AI Agents & Workflows
Develop AI agents and workflows integrated with internal systems.
12. Model Evaluation
Test and evaluate AI models including LLMs and computer vision systems.
13. MLOps Exposure
Work with MLOps practices including:
- Model versioning
- Deployment automation
- Performance monitoring
14. Agile Development
Apply Agile methodologies for iterative and high-quality software delivery.
15. Documentation & Standards
Maintain clear documentation and follow coding standards and best practices.
Languages & Frameworks
- Programming: Python, C, Rust, Java
- AI/ML: TensorFlow, PyTorch
- Frontend: React.js, Angular.js, Vue.js
Infrastructure & Tools
- Kubernetes, Docker
- Kafka / RabbitMQ
- Prometheus, Grafana
- SQL / NoSQL Databases
- CI/CD Pipelines
- Infrastructure as Code (IaC)
- Service Mesh, API Gateway
Requirements
- Bachelor’s degree in Computer Science or related field
(or 5+ years of relevant experience in lieu of degree) - 10+ years of experience in designing and deploying cloud-native applications
- Strong proficiency in Python, with hands-on experience in TensorFlow and PyTorch
- Expertise in multiple programming languages (C, Rust, Java, Python)
- Strong backend development skills with SQL and NoSQL databases
- Experience with modern JavaScript frameworks (React, Angular, or Vue)
- Deep understanding of:
- SOLID principles
- 12-Factor app methodology
- Distributed messaging systems
- API gateways & service mesh
- Serverless architecture
- Experience with event-driven architecture using Kafka or RabbitMQ
Best Regards,
Rajdeep Singh
Technical Recruiter | Sumeru Inc.
ISO 27001 Certified
📍 2401 15th Street NW, Washington, DC – 20009
📧 rajdeep.singh@sumerusolutions.com
📞 +1 404-963-8772