Slide 14
Slide 14 text
Problem - Dropping Data
● Data missing but stream watermark advancing anyways
○ Very surprising to discover this
○ Uncovered that transient errors are skipped over
○ No way for us to replay dropped data
● Beam & BigQuery retry options
○ All errors are considered transient except if BigQuery says that the error reason contains one of
ImmutableSet.of("invalid", "invalidQuery", "notImplemented")