Accelerating Spark at Microsoft using Gluten & Velox

Accelerating Spark at Microsoft using Gluten & Velox Microsoft Confidential

Who We Are Zhen Li Software Engineer Spark Runtime Team,
Fabric @Microsoft Swinky Mann Software Engineer

Outline • Introduction • MS Fabric and Internal Spark •
Integration with Gluten and Velox • Optimizing Performance • Conclusion Microsoft Confidential

Microsoft Fabric The data platform for the era of AI
Microsoft Confidential OneLake Data Factory Synapse Data Warehousing Synapse Real Time Analytics Power BI Synapse Data Engineering Synapse Data Science Data Activator

Microsoft Confidential Internal Spark (without Gluten & Velox) • Internal
Spark – Apache Spark + our optimizations. • 1 TB TPCDS - All 99 queries • Spark 3.4, ABFS, parquet • Lower is better. • Internal Spark 2x faster than Apache Spark. 100% 50%

Integration of Velox-Gluten • ABFS Support • Added ABFS (Azure
Blob Filesystem) storage adapter • OneLake integration, Auth, etc. • Support for Spark operators • Operators: Expand, BroadcastNestedLoopJoin, CartesianProduct, RollupHashAggregation. • 20+ Spark Functions: uuid, date_from_unix_date, from_utc_timestamp etc. • INT96/INT64 Timestamp in Velox parquet scan. • Spark scan with metadata columns in Gluten. Microsoft Confidential

Integration of Velox-Gluten • Reliability Improvement: • UT for Spark
3.3, Spark 3.4: 300+ UTs fixed from 40 suites. • Committed to making changes for Spark 3.5. • Delta Integration: • Support for Delta Update, Delete, Merge, Convert To Delta Commands • Reimplementation of unsupported UDFs to avoid fallbacks • Columnar implementation for Delta Optimized Write • Fallbacks for unsupported scenarios (Delta log checkpoint, Deletion Vectors) • Delta UTs coverage and testing. Microsoft Confidential

Optimizing Performance Microsoft Confidential

Optimizing Performance • Scan • Data Reading in Parallel: •
Concurrent Data Reading Support in the ‘preadv’. • Split preloading in Gluten. • Fabric intelligent cache integration. Microsoft Confidential

Optimizing Performance • Hash join & Hash aggregation • Avoid
re-computing normalized keys in HashTable::groupProbe (PR:6406 – query67). • Improve normalized key join probe (PR:6695 – query64). • Store duplicate row address in vector for join probe (PR:9079 – query72). Microsoft Confidential

Conclusion • Working towards making Gluten & Velox more reliable,
robust and performant. • Actively contributing to Gluten & Velox community. Microsoft Confidential

Thank You We are Hiring ! Microsoft Confidential

Accelerating Spark at Microsoft using Gluten & ...

Accelerating Spark at Microsoft using Gluten & Velox

Ali LeClerc

More Decks by Ali LeClerc

Other Decks in Technology

Featured

Transcript