Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Elasticsearch 在智能运维领域的运用
Search
medcl
January 20, 2018
Technology
340
0
Share
Elasticsearch 在智能运维领域的运用
分享Elasticsearch和X-Pack组件在智能运维领域的技术原理和应用实践,如非监督型机器学习在自动的异常检测、高级关联和分类、根源问题诊断、早期故障预测等方面的应用等。
medcl
January 20, 2018
More Decks by medcl
See All by medcl
Elastic Stack- Past, Present, & Future
medcl
0
57
A Spider Written in Golang
medcl
1
81
又一个爬虫
medcl
0
99
Introduction to Beats and extending Beats
medcl
0
110
Elasticsearch & Bigdata
medcl
2
230
Elastic Stack V5
medcl
0
110
Elastic Stack V5
medcl
0
79
基于Elastic Stack的数据探索与分析@QConBeijing2016
medcl
1
440
Introduction to Elasticsearch @ FOSSASIA2016
medcl
0
7.6k
Other Decks in Technology
See All in Technology
AI 時代の Platform Engineering
recruitengineers
PRO
1
100
The 7 pitfalls of AI
ufried
0
200
AIの揺らぎに“コシ”を与える階層化品質設計
ickx
0
260
サンプリングは「作る」のか「使う」のか? 分散トレースのコストと運用を両立する実践的戦略 / Why you need the tail sampling and why you don't want it
ymotongpoo
3
120
Shiny New Tools Won't Fix Your Problem
trishagee
1
110
生成AIはソフトウェア開発の革命か、ソフトウェア工学の宿題再提出なのか -ソフトウェア品質特性の追加提案-
kyonmm
PRO
2
860
Digital Independence: Why, When and How
wannesrams
0
300
Oracle Cloud Infrastructure:2026年4月度サービス・アップデート
oracle4engineer
PRO
0
370
Sociotechnical Architecture Reviews: Understanding Teams, not just Artefacts
ewolff
1
140
需要創出(Chatwork)×供給(BPaaS) フライホイールとMoat 実行能力の最適配置とAI戦略
kubell_hr
0
2.1k
AIが盛んな時代に 技術記事を書き始めて起きた私の中での小さな変化
peintangos
0
360
雑談は、センサーだった
bitkey
PRO
2
210
Featured
See All Featured
Keith and Marios Guide to Fast Websites
keithpitt
413
23k
The Web Performance Landscape in 2024 [PerfNow 2024]
tammyeverts
12
1.1k
Sam Torres - BigQuery for SEOs
techseoconnect
PRO
0
260
A Guide to Academic Writing Using Generative AI - A Workshop
ks91
PRO
1
290
A brief & incomplete history of UX Design for the World Wide Web: 1989–2019
jct
1
370
Automating Front-end Workflow
addyosmani
1370
200k
Pawsitive SEO: Lessons from My Dog (and Many Mistakes) on Thriving as a Consultant in the Age of AI
davidcarrasco
0
130
How STYLIGHT went responsive
nonsquared
100
6.1k
The AI Revolution Will Not Be Monopolized: How open-source beats economies of scale, even for LLMs
inesmontani
PRO
3
3.4k
Everyday Curiosity
cassininazir
0
200
How to Get Subject Matter Experts Bought In and Actively Contributing to SEO & PR Initiatives.
livdayseo
0
110
Being A Developer After 40
akosma
91
590k
Transcript
Elasticsearch 在智能运维领域的运用 Elastic 技术专家 曾勇
什么是智能运维?
None
人工智能!
落地!
我们具体聊聊运维的痛点!
服务器器、硬件、⽹网络 …
软件、服务、代码…
传感器器、设备、物联⽹网… Image Credit: https://www.flickr.com/photos/teco_kit/23908928999
每时每刻产⽣生⼤大量量的…
事件、⽇日志…
Metrics、指标…
期望得到的…
报表…
异常…
告警…
因为…
Improve Uptime Stability Visibility Reduce Errors Downtime Time to Resolution
你需要…
所有这些. Unstructured Machine Learning Query language Fast Highly available Secure
Enrichment Advanced Analytics Dashboards Scalable Alerting SaaS Log correlation APIs Visualizations Real-time Drill down Reports Data sources
Elastic 提供 所有这些. Unstructured Machine Learning Query language Fast Highly
available Secure Enrichment Advanced Analytics Dashboards Scalable Alerting SaaS Log correlation APIs Visualizations Real-time Drill down Reports Data sources
Elastic 为什什么不不⼀一样?
⽤用户⽆无处不不在
LOG MANAGEMENT MOBILE APM SYSTEM MONITORING TIME SERIES WEB MONITORING
ANOMALY DETECTION Elastic 不不在 <Gartner 魔⼒力力象限> 尽管
Search Analytics Numbers Text Logs Historical Metrics Real time Heuristic
Machine Learning 多样性 才是我们的⻓长处 不不过
回到话题
运维之监控! • 监控指标的收集 • 监控数据的存储 • 监控数据的分析 • 监控数据的告警
运维之监控! • 监控数据的存储 • 监控数据的分析 • 监控数据的告警
Metricbeat, Filebeat, Auditbeat & Logstash System • Linux • MacOS
• Windows • Perfmon Custom apps • JMX/Jolokia • PHP-FPM • Golang • Dropwizard Storage • Ceph Cloud • AWS • GCP • DigitalOcean Queues • Redis • Kafka • RabbitMQ Security • ArcSight Caches • Memcached Containers • Docker • Kubernetes Virtualization • vSphere Datastores • MySQL • PostgreSQL • MongoDB • Couchbase • Aerospike Network • Netflow • Packets Web servers • Apache • Nginx Other • HAProxy • Zookeeper • Prometheus • Graphite • Icinga … …
运维之监控! • 监控数据的分析 • 监控数据的告警
运维之监控! • 监控数据的分析
运维之监控! 人工智能 亦或 只能人工
也还行, 也就几千个指标! CPU Metrics
也还行, 也就几千个指标! 几万? CPU Metrics
None
Elastic Stack 可采集海量指标 • 爆炸!
ELASTIC 的人工智能,智能运维。
ELASTIC 的人工智能,智能运维。 ELASTIC 的机器学习。
先看看监控数据 • 三大类 – Logging – Tracing info – Metrics
都是时序型数据!
什么是时序型数据?
为什么使用时序型数据?
为什么使用时序型数据?
Bucketing
Bucket 的选择
监督型机器学习。
非监督型机器学习。
时序型指标 特征化,模型化!
让机器去帮你 监控海量指标,发现异常!
DEMO
总结 • 运维已进入精细化智能化时代 • AI 不会让运维失业 • 让机器做机器擅长的 • Elastic
让运维分析更简单
None
IT-OPS-KPI
IT-OPS-NETWORK
IT-OPS-SQL
关联分析