Loading...
Expertise
Menu
IT Infrastructure Reliability Engineering and Operations
Created 9 months ago in Technology / Other

I have over 24 years of experience in diverse areas of IT in Retails, Ecommerce, Fintech, Travel-Tech, Consumer goods, Online Services, Government, Oil and Gas, Telco’s, Banks etc. I have worked on over 2000 IT small, mid to a large-scale project. I have strong expertise in IT Infrastructure Reliability Engineering, with my innovative solutions, AI Powered Observability and Monitoring System, BOT’s and analytics, I constantly retained over 99% availability of all the IT services with superior customers experience, even during high peak time (during festivals) and saved millions of dollars of the companies. With strong const savings efforts I achieved most optimized cloud services. I have strong knowledge of Cyber Security and have successfully responded to thousands of security threats and vulnerabilities; I also introduced AI in Cyber security and automated several toiled processes. I headed IT Software and AI Products design and development and delivered more than 3000 AI powered BOT’s, Analytics dashboards, Observability and IT infrastructure Monitoring systems that saved million dollars of the companies.
I have helped all type of organizations including startups who are deeply struggling with the reliability of their IT infrastructure and services and lost huge revenue during peak business time or festival demand. My key areas of expertise are listed below but not limited to the following:
Digital Transformation and automation
24/7 Site Reliability Engineering and Operations.
24/7 Infrastructure NOC Operations Centre Design, setup and Management.
24/7 Monitoring of all the Network applications, servers, devices, Mobile Apps, Customers behavior, Cyber Security Threats, IOT Devices etc. and anomaly response.
24/7 DevOps and Tech Support Canter design.
Good hands-on expertise in both On-premise servers, Data Centers and on Cloud operations networking devices like Multilayer-switches, Routers, Wireless controllers, Teleconferencing systems Desktop hardware’s, software’s.
24/7 Cyber Security Operation Centre Design and Support with VAPT assessments.
Project Management (Agile and Scrum).
Infrastructure and Cyber Security Major Incidents and Escalation Management.
Disaster Recovery, Business Continuity planning, DR site setup, and Risk Management
ITIL functions like Incident, Problem, Change, including CMDB design and support.
IT Budgeting, Contracting, Customers, Vendors, Partners Relationship Management.
People & Resources Hiring, Training, Mentoring & Development and Retentions.
Business capacity planning and building.
Design of IT Service Support SOP’s, Technical Documentations, Guidelines, Customers Documentations, Corporate Security Policies, Users IT Assets usage policies etc.
Software Development and Software Project Management
AI Product Design, Development and Support.
CRM Integrations and Support.
IT Service Management Products Selection and Integrations.
Related Topics
Sachin Kumar
Delhi
Send Message
I am an IT leader with over 25 years of global experience driving digital transformation and service operations across leading multinational corporations in various sectors, including Travel Tech, Fintech, Telecom, FMCG, Hospitality, Banking and Finance, Hospitals and healthcare, eCommerce, Tech Support, and Government. I bring deep expertise in building and leading high-performing IT organizations, having successfully managed and mentored teams of over 300 engineers who support mission-critical systems for billions of users. Under my leadership, traditional IT divisions have been transformed into smart, automated, AI-powered operations, shifting from cost centers to strategic profit centers. I specialize in delivering scalable, resilient, and cost-effective IT solutions that align with business goals and drive measurable outcomes. ► Strategic Planning & Operational Support: Dedicated leader with a passion for results and a proclivity for aligning technology teams to achieve greater business objectives. Spearheaded the execution of strategic plans and service/operational goals to hit key performance metrics that support enterprise expansion. ► Operational Excellence & Improvement: A proven track record of successfully analyzing and solving critical business problems. Increase organizational effectiveness by continuously assessing, initiating, prioritizing, and driving technology solutions. ► Engineering Capabilities: I am a strong, result-oriented engineering leader and expert in designing and delivering cost-effective solutions to any business problem about their IT services. I have the expertise to optimize operations support, process automation, and the design of auto-remediation and self-healing systems to absorb an organization’s growing business and IT needs. Areas of expertise include but are not limited to the following: - 24/7 Site Reliability Engineering & Operations - 24/7 IT Services Management (Incidents, Problems, Change management) - DevOps - AIOps, MLOps - AI, Machine Learning & Data Science - Infrastructure NOC, - Corporate IT Services & Tech Support - Cyber Security - Digital Transformation & Smart Automation - Database engineering - Application Engineering & Support - Cloud Management (AWS, Azure, GCP, OpenStack, Oracle Cloud) - ERP Selection and Implementation - Data Center Operations - Software Design, Development & Support - AI Model Selection & Integrations (MCP, RAG, LLM’s) - Large Language Model (OpenAI, Anthropic, Mistral AI, MetaAI, Cohere) for AI Selection and Implementation - GenAI - IT Service Delivery and Service Support - Product Management - Compliance, Safety, and governance product selection and implementation - Disaster Recovery, Business Continuity planning, DR site setup, and Risk Management - IT Budgeting, Contracting, Customers, Vendors, Partners, Relationship Management - People & Resources Hiring, Training, and Retention. I have successfully delivered over 2,000 IT projects, including more than 500 AI-enabled and 50 LLM-integrated projects, across various internal and client-facing initiatives for HR, Sales, Marketing, IT, ITSM, SRE, and Corporate IT.
the startups.com platform
Copyright © 2025 Startups.com. All rights reserved.