Friday, March 6, 2026

Hiring for AiOps Engineer / Developer Technical Lead at San Francisco, CA

Hi,

Job Title: AIOps Engineer / Developer – Technical Lead
Location: San Francisco, CA
Duration: 12+ Months 


Job Summary:

We are seeking an experienced AIOps Engineer / Developer – Technical Lead to design, implement, and lead intelligent operations solutions that enhance system reliability, observability, and automation. The ideal candidate will combine deep technical expertise with leadership skills to drive AI-driven monitoring, incident management, predictive analytics, and automation initiatives across enterprise environments.


Key Responsibilities:

Technical Leadership

·       Lead the design and implementation of AIOps solutions across enterprise systems.

·       Provide technical guidance to engineering teams on architecture, tooling, and automation strategies.

·       Drive adoption of AI-driven monitoring and operational intelligence practices.

AIOps & Automation

·       Develop and deploy AI/ML-based monitoring and anomaly detection solutions.

·       Implement automation for incident detection, root-cause analysis, and remediation.

·       Integrate AIOps tools with existing monitoring and ITSM platforms.

·       Improve system observability across infrastructure, applications, and cloud platforms.

Platform & Integration

·       Work with logging, monitoring, and observability tools.

·       Integrate AIOps capabilities with cloud-native and hybrid environments.

·       Collaborate with DevOps, SRE, and Infrastructure teams to enhance reliability.

Performance & Reliability

·       Analyze operational data to identify patterns and reduce Mean Time to Detect (MTTD) and Mean Time to Resolve (MTTR).

·       Drive continuous improvement initiatives across production systems.

Collaboration & Governance

·       Partner with cross-functional teams including Development, Operations, Security, and Business stakeholders.

·       Ensure alignment with enterprise architecture, compliance, and governance standards.


Required Skills & Qualifications:

·       Strong experience in AIOps, DevOps, or Site Reliability Engineering (SRE).

·       Experience leading technical initiatives in complex enterprise environments.

·       Hands-on experience with:

o   Monitoring and observability tools

o   Log analytics platforms

o   Incident management and automation frameworks

·       Experience with scripting or programming languages (e.g., Python, Shell, etc.).

·       Knowledge of cloud platforms (AWS, Azure, or GCP).

·       Strong analytical, problem-solving, and leadership skills.

·       Excellent communication and stakeholder management capabilities.


Nice to Have:

·       Experience implementing AI/ML models for operational intelligence.

·       Exposure to enterprise ITSM integrations.

·       Experience working in regulated or large-scale environments.

Thanks & Regards

Akshit Singh

M : 972-961-2517
https://www.linkedin.com/in/akshit-kumar-singh-5ba3aa138/

E : akshit.s@noviainfotech.com

--
You received this message because you are subscribed to the Google Groups "NoviaJobs" group.
To unsubscribe from this group and stop receiving emails from it, send an email to noviajobs+unsubscribe@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/noviajobs/CAC7KMKk%2Bgz1c4n%3DPE5dxiFD78x22uc-T5j3W%3DFeCGkrMPt%2BZ0g%40mail.gmail.com.

No comments:

Post a Comment

Sr. Network Operations Engineer for Roseville CA

Location: Roseville CA Duration: 6 Months   100% ONSITE ROLE   Sr. Network Operations Engineer   10+ years of ex...