Order to ITOps Chaos! Shailesh Manjrekar, Chief Marketing Officer with “The Data-Fabric for Observability, AIOps and GenAI” Powered by “Macaw GenAI Assistant”
Experience Dependent On Hybrid IT Hybrid IT Environment Challenges Siloed Tools / Tools Sprawl Inability to predict/prevent Noisy Data No business context Skillset Gap Customer Experience Claims Processing On-Prem SaaS Monitoring Tools
Challenges Challenges To Navigate Apps Edge Sites Public Clouds Data Centers Security Multi-Cloud IT Landscape Siloed Tools sprawl Digital Resilience Noisy Data Data Everywhere Skillset Gap Business Demands Real time Visibility Automated Operations Operational Efficiencies Security Resilience Customer Experience Accelerated AI assisted decision making
Data Fabric” – Unifying siloed domains Business Demand Siloed =>Unified & Data Driven The Data Fabric for GenAI - Operational Domain Convergence around Data & AI/ML AIOps Full Stack Observability Converged Data Platforms Intent-based Automation Real time Visibility Automated Operations Operational Efficiencies Security Resilience Customer Experience Accelerated AI assisted decision making Apps Edge Sites Public Clouds Data Centers Security Multi-Cloud IT Landscape
Observability Evolution Domain-centric to Full Stack Cross-Domain Insights – New KPI’s! • Alerts and Events Only • Static Dashboards • Passive, Sampling based • Root cause Identification • Domain-centric Tools Sprawl • Active and M.E.L based • Business Context and Impact • DevSecOps and SRE • Composable Dashboards KPI- Availability (domain - centric) KPI- Performance (domain - centric) KPI- Experience (Cross - domain) Key Missing Piece is “Observability Pipelines”
“Data Fabric for Observability, AIOps & GenAI” Drive Insights to Actions with all your Observability Data Macaw GenAI Assistant Customer Data Intent based Automation Data Fabric for GenAI Macaw GenAI Assistant Customer Data Intent based Automation Data Integration, Dynamic Data Ingestion AutoML Automation Data Enrichment, Contextualization Data Automation, Transformation Telemetry / Observability Pipelines Data Routing Conversational Queries Streaming Engine Analyze Datasets AI for Intent-Driven Automation
Queries! Low code -> Nocode Democratize Observability, AIOps and Automaton with Macaw! Create a composable dashboard for my syslog datasets Create a new RDA pipeline to create ServiceNow tickets Explain Network outage impact upstream Summarize all my critical Incidents and suggest probable remediations Validate device and put in maintenance mode
Composability across disparate data sources and dashboards improved productivity by 60% • Fault Management reduced noise and improved MTTI and MTTR by 40% • Performance Management improved customer experience Tier1 Telecom Provider Improves Productivity by 60% with Observability and AIOps • Improve Fault Management & visibility with near real time discovery and enrichment of alerts and events • Improve Performance Management with predictive analytics and AI/ML based regression analytics for anomaly detection and predictions • A Single pane of glass, persona-based composable dashboards which provided health and operational insights • Provide mission critical reliability, availability and scalability • Ingest and normalizie SNMP Traps, Syslog, GnMI and Bulkstats datatypes. Solution Details Client Background & Objectives Problem Details Client was able to implement CloudFabrix’s solution in production over a short 1 month duration and deliver value.. CloudFabrix Composable Dashboards and Composable pipelines powered by RDAF catered to the users’ needs and enabled ingestion of any data from any source. Client Value and Results “CloudFabrix enables SPOG to cater to different personas. Their RDA Bots and pipelines enabled choosing best of breed analytics and insights, maximizing the ROI and rapid development.to meet business needs. They enabled Co-development and joint decisions on the development of dashboards and analytics/insights” – Lead Telecom Architect “ ” Global Telcom giant with Complex multi-vendor 5G Edge and O-RAN, Campus and Datacenter, Optical and Mobility Business units was challenged with coalescing disparate data sources with legacy operational tools for effective Fault Management & Performance Management Operations.
Data-Fabric for Observability, AIOps and GenAI Edge Cloud RDA Edge Powered By Robotic Data Automation Fabric - RDAF 1000+ Bots Low-Code/ No-Code Composable Workflows Data Bots Distributed Data Fabric Macaw Generative AI Low Code to No Code Large Language Models Telemetry / Observability Pipelines AIOps Solutions Operational Awareness Organizational Awareness Business Awareness • Composable Dashboards • Composable Services • Observability Pipelines • Composable Bots • Visibility o X-domain Topology Discovery o Dependency Mapping • Insights o Correlation Reduction o Predictive Analytics • Automation o Root Cause Analysis o Remediation, ServiceOps • Generative AI • Incident Management • Analyze any Data-set DevSecOps/ ITOps BizOps Platform Eng.
Management using Conversational Queries! Ø Show incidents Ø +summarize above incidents Ø Show incidents for last weeks Ø +Summarize above incidents in HTML format. Ø Show incidents with i_summary contains avg latency Ø + provide recommendation on above incidents in HTML format Ø + provide recommendation where incident_id is CFX20231026902681eb08 Ø + provide probable root cause where incident_id is CFX20231026902681eb08
cause analysis using Conversational Queries! Generate charts and pipelines Ø generate RDA pipeline to query oia-alerts-stream and run clustering on message column with min cluster size of 200 Ø + at the end write the output to a stream 'clustered-alerts’ Ø generate multibar chart for oia-alerts-stream groupby a_severity and a_status Ø generate multibar chart for oia-alerts-stream groupby a_status and a_severity Regression analysis Ø show regression models Ø show anomalies for last week Ø show devices with increasing trend Ø show devices with increasing cpu usage Anomaly Detection Ø show high severity anomalies for device 10.95.133.93 Ø show regression chart for device 10.95.126.18 and cpu usage metric Ø show regression chart for device 10.95.105.64 and cpu usage metri Ø show devices belonging to group 0 for network usage metric
- Digital Eco-system Enabler runs 30% of traffic • Full Stack Discovery and Application and Impact Dependency mapping across Applications, VNF, CNF and Kubernetes o Change Management, Compliance, Management, Vulnerability Exposure, CMDB synchronization • Enrichment and Multi-layer Event Correlation - AI/ML, Stack-based, Attribute-based o Application and Infrastructure Correlation • Operations Productivity and TCO Reduction o RDAF based data source integrations, normalization and ingestion, capacity utilization, and consumption modeling • Future Proof Business with AI/ML and Continuous Learning o Predictive Analytics and Anomaly Detection, NLP and Sentiment Analysis, building Knowledge Corpus • Modern Incident Management with Virtual War Room o 97% Noise Reduction, 60% + MTTI, 50%+MTTR, Bi-directional integration with ServiceNow SP Service Assurance Managed Hosting Services Managed Security Services Core Stack App. Access EdgeDC Core & Cloud Hyperscaler Cloud CloudFabrix Solution VPN Software Stack CICD Application Stack Chosen over 5 competitors - Automated, Assured and Secured with CloudFabrix Data-centric AIOps Platform
and IBM Consulting – Helping Clients with Composable Analytics to unify Observability, AI, and Automation “I’m excited to share that IBM Consulting and CloudFabrix, the data-centric AIOps platform leader, are working together to help clients implement next-gen IT operations use cases like enterprise-wide observability, composable in-place search, and asset intelligence analytics by unifying data, AI and automation. We’re bringing together IBM Consulting's deep expertise in observability and AIOps solutions for global and deeply complex clients with CloudFabrix’s Composable Analytics powered by Robotic Data Automation Fabric” Meenakshi Srinivasan, Senior Partner, IBM Consulting
Observability Platform Networking Infrastructure Multicloud Security Application Unified Experiences Data from multiple operations domains Real-time Insights for Unified Experiences FSO Cisco Full-Stack Observability Bring data together from across application, networking, multicloud, infrastructure and security And drive business outcomes To unlock core and use cases
“Data Fabric for Observability” Drive Insights to Actions with all your Observability Data Macaw GenAI Assistant Customer Data Intent based Automation Data Fabric for Observability Data Automation & Ingestion Hybrid Data sources Domain-specific modules Hybrid Data Integration • Composability, • Contextualization and • Modernization Dynamic Data Automation & Ingestion Domain-specific modules
IT admin Single pane of glass Modules Persona Use case Campus Analytics NOC admin Campus N/W visibility Modules Persona Use case SAP Observability Basis admin SAP landscape visibility Modules Persona Use case Asset Intelligence Analytics IT planning Asset discovery/intelligence, App Dependency maps / CMDB update Modules Persona Use case Operational Intelligence Analytics IT Operations Event Correlation, ITSM Modules Persona Use case Infra Observability IT Operations Full Stack Observability
• Closed-loop automation, Root Cause Analysis • Prioritized service impact. Suppress extraneous alarms • Real-time performance metrics and machine-learning baselining • Anomaly and performance degradation detection • Optimize resources and workloads • Correlate services impact and isolate the underlying infrastructure problem • Closed-looped fast remediation •Automatically suppress extraneous alarms • Ability to ingest disparate data sources like SNMP Traps, Syslog, GnMI, Bulkstats • Automatically determine baselines and detect anomalies • ML-based analytics for deep actionable insights • Rapid development of dashboards and reports to visualize data collected from all required sources (e.g., devices and controllers) • Active-active solution • Ability to Ingest Millions of metrics • Multi-tenant, Microservices based architecture for on-premises, hybrid and Cloud deployments Fault Management Performance Management Automated Root-cause and Service Impact Composable Analytics Single Pane of Glass, Composable Dashboards Reliability, Availability and Scalability
Assets, Dependencies and their Lifecycle Data “ … saved us time & money… and greatly simplified the complexities of tracking Hardware/Software assets and their dependencies, ... we now have better control of managing lifecycle and upgrade initiatives … ” - Sr. Director, Cloud & IT Ops
Data Automation Fabric Data Fabric AI / ML Engine Asset Data Ops Data Biz Data AIOps IT Planning IT and Business Operations Asset Intelligence Service Real time visibility into Full stack interdependencies IT Change Refresh Impact Analysis Actionable Insights to adopt consumption based IT models Asset Intelligence Alert Noise Reduction Root Cause correlation Analysis Automated Incident Diagnostics & Remediation + AIOps service RDAF • 7 Data centers • 500K+ IT assets • 800 Facilities • 400 changes in Cy21 • 20+ member team • Business Problem – Limited Asset visibility across Healthcare IT, Always ON & Access anywhere application uptime, Contract Mgmt, Vendor Ticketing Automation • Solution – Asset Discovery, Application Dependency Mapping, Change Management, Utilization Reports, Root Cause Analysis and Incident Resolution • Benefits – 100% Asset visibility, 40% Reduction in OPEX, Per App Impact assessment from 240 Hrs -> 15 mins