AI Engineer
Full Time
full time
9 Jun 2023
Verified by Turrior
Content + Source + Freshness • 13 Dec 2025 • 95% confidence
80 / 100
Offer value
Moderate value addressing a broad range of engineering needs in HPC, balanced with opportunities for technological influence.
- Contribute to high-priority HPC and AI projects
- Engage with complex, large-scale system challenges
- Diverse projects providing skill enhancement
- Requires a firm foundation in AI and HPC technologies
Pros
- Wide variety of projects involving HPC and AI
- Potential to influence high-level architectural decisions
- Opportunity to grow technical expertise in a significant company
Cons
- Possibly ambiguous role and unclear growth path
- Intense workload typical in high-performance computing environments
- Limited flexibility in working hours
Who it's for
Mid Level • Office-based (Hyderabad)
Good fit
- Experienced engineers in AI and HPC
- Candidates eager for technical challenges
- Professionals wanting to influence architecture and design
Not recommended for
- New entrants to AI and HPC fields
- Those preferring straightforward, routine tasks
- Candidates needing significant work-life flexibility
Motivation fit
Desire to work with cutting-edge AI technologiesInterest in bridging application and infrastructure designEagerness to engage in critical problem-solving
Key skills
Linux systems managementProgramming skills (Python, Bash, Go)Experience with CI/CD and container orchestrationNetworking and storage expertise
Score: 80/100 AI verified analysis
About the job
Job Description:-
- Provide a support to portfolio of technical solutions within a delivery channel focusing on HPC, AI and ML system and software tools.
- Works with application, data, and infrastructure teams to produce optimal, high level, conceptual designs for projects. Supports enterprise level solutions that integrate across applications, systems, and platforms.
- Manages changes in process, policy, and standards as they relate to the architecture and design principles.
- Researches and maintains knowledge in emerging technologies and solutions to solve business problems.
- Serves as a technical expert and critical resource across multiple disciplines.
Roles and Responsibilities:-
- Collaborate with internal stakeholders to understand future NVIDIA deployments to support project exigencies and improve DGX POD efficiency in a Kubernetes based platform.
- Review architecture of applications and supports technical design sessions with architects and developers, including the creation of class models, sequence diagrams, component models and design specifications.
- Creates project and application architecture deliverables that are consistent with architecture principles, standards, methodologies, and best practices. Researches and maintains knowledge in emerging technologies and possible application to the business. Designs and develops new tools to support Software Development Lifecycle (SDLC) processes.
- Serves as a liaison with the engineering team around required features, critical bugs, and testing of new functionality. Communicates implications of architectural decisions, issues and plans to business and IT Leadership. Provides input to the development of project initiation documents including objectives, scope, approach, and deliverables, when needed.
- Partners with ITS business representatives and business leaders to understand business drivers and critical needs. Ensures alignment between the business strategies and application technology roadmap while advising and consulting leadership on costs, benefits, and implementation requirements.
- Supports team initiatives across functions with application triage, performance engineering, and testing activities. Assists in the troubleshooting and triage of complex applications issues. Provides support/guidance to development teams throughout the analysis, design, development, and testing processes. Resolves complex technical issues as needed to support solution development.
Requirements:-
- Bachelor’s in computer science (CS), Computer Engineering (CSEE), or related STEM field and/or equivalent professional experience.
- Strong experience supporting Linux, OS installation and automation (PXE, kickstart, ansible), networking and storage.
- Strong experience supporting TCP/IP networking fundamentals, ports, IP subnets, DNS, routes.
- Expert programming/scripting skills in Linux Shell/CLI, Bash, Python, and Go.
- Strong understanding of CI/CD processes and deployment tools, including ArgoCD, Kubernetes, Helm, and Docker.
- Experience with resource management systems and job scheduling, including running and debugging parallel programs.
- Strong experience using GIT and other version control systems.
- Experience supporting large-scale data management systems serving hundreds of users/data scientists.
- Experience with provisioning and configuration management tools; Puppet, Ansible, Chef, Terraform, etc.
- Excellent critical thinking, verbal communication, and problem-solving skills.
Preferred Qualifications :-
- BS/MS. in Computer Science (CS), Computer Engineering (CSEE), Electrical Engineering (EE), or related/relevant STEM degree with three or more years of experience supporting HPC and AI focused technologies.
- Familiarity with Nvidia GPU’s on Linux, HPC (High Performance Computing), Infiniband, MPI, RDMA technologies.
- Experience supporting AI Data Science Projects and software tools in a HPC environment.
- Experience with supporting modern deep learning software architectures and frameworks including TensorFlow, Pytorch or other frameworks.
- Familiarity with supporting different cloud providers.
- Strong expertise with Agile Methodology and supporting tools.
- Ability to effectively communicate and engage with AI engineering and data science teams.

