Anomaly Detection on Remote Sensing with Ray + Horovod (Linsong Chu, IBM Research)

Anomaly Detection on Remote Sensing Data with Ray+Horovod Linsong Chu
- IBM Research

Background • NASA collected averagely 10M images a day that
are spatiotemporally referenced. • IBM Research worked with NASA to develop a solution for ranking images based on their perplexity (e.g., high level of spatial dissimilarity with the surroundings, or high levels of temporal dissimilarity with historical observations) • High rank images can indicate interesting event, which may be inspected by an analyst.

A Deadly Debris Flow in India. The image pair above
shows a closeup of the same area before and after the debris flow, on January 20 and February 21, 2021 https://earthobservatory.nasa.gov/images/147973/a-deadly-debris-flow-in-india

Approach • Predict the image of time T • Images
from timestamp of T-K to T-1 are being used as input • U-Net architecture is used for encoding and decoding the input to reconstruct and predict the image of time T • Compare the prediction and ground truth of time T • Multiple metrics can be used – MSE, deviation, etc. • The difference is used as the proxy to indicate the rank

Challenges • High Volume • The volume of data is
significant, 10M raw images lead to billions of input • Distributed training is necessary • Volatile Volume • For a specific region of interest, daily volume can be very different • Serverless training is preferred

Examples of Anomalies detected I - NDVI (Vegetation Index) Data
is used - Validated as Woolsey wildfire

Anomaly Detection on Remote Sensing with Ray + ...

Anomaly Detection on Remote Sensing with Ray + Horovod (Linsong Chu, IBM Research)

Anyscale

More Decks by Anyscale

Other Decks in Technology

Featured

Transcript

Anomaly Detection on Remote Sensing Data with Ray+Horovod Linsong Chu

Background • NASA collected averagely 10M images a day that

A Deadly Debris Flow in India. The image pair above

Approach • Predict the image of time T • Images

Challenges • High Volume • The volume of data is

Examples of Anomalies detected I - NDVI (Vegetation Index) Data