Upgrade to Pro — share decks privately, control downloads, hide ads and more …

[Apache CommunityOverCode NA 2025] Becoming an ...

[Apache CommunityOverCode NA 2025] Becoming an Apache Airflow Committer from 0

https://communityovercode.org/past-sessions/community-over-code-na-2025/

How a Complete Beginner in Data Engineering / Junior Computer Science Student Became an Apache Airflow Committer in Just 5 Months—With 70+ PRs and 300 Hours of Contributions

This talk is aimed at those who are still hesitant about contributing to Apache Airflow. I hope to inspire and encourage anyone to take the first step and start their journey in open-source—let's build together!

Avatar for Zhe-You(Jason) Liu

Zhe-You(Jason) Liu

September 12, 2025

More Decks by Zhe-You(Jason) Liu

Other Decks in Programming

Transcript

  1. Table of Content 1. What is Apache Airflow? 2. Why

    Contribute to Open Source? 3. From First PR to Committer 4. Getting Started with Airflow Contributions
  2. Zhe-You(Jason) Liu • I’m from Taiwan 󰑐 • Apache Airflow

    Committer • LFX'25 Mentorship - CNCF Krkn • Upcoming Intern @Microsoft • NCKU CSIE Senior GitHub: jason810496 LinkedIn: zhe-you-liu
  3. • Most Common Data Orchestration Tool • 40k+ stars on

    GitHub • Used by over 70k companies ◦ including Apple, Uber, LinkedIn …
  4. Operator • PythonOperator • BashOperator • GCSToS3Operator • HiveToMySqlOperator •

    KubernetesPodOperator • BranchOperator • … 173 integrations
  5. What Problems Does Apache Airflow Solve? 1. Integration with Data

    Ecosystems 2. Retry-ability of Workflows 3. Workflow Observability Significantly reduce operational workload
  6. Why Apache Airflow? 1. Written in Python ! 2. Ranked

    as the 4th most popular project in the ASF 3. Over 40k stars on GitHub 4. Widely used in Data Engineering
  7. Benefits of Contributing to OSS • Coding skills ▪ Design

    Patterns, Unit Test, Code Review • System Level Design • Career Opportunities (?)
  8. Benefits of Contributing to OSS • Collaborate with top contributors

    over the world • Communication Skills • Design Documentation
  9. Resolve OOM when reading large task logs • This PR

    has been rewritten twice • It took half a year to get merged
  10. How to find Issues to Resolve? 1. Look for "Good

    First Issue" labels 2. Recent Issues and Pull Requests 3. Apache Airflow Slack 4. Airflow Improvement Proposals (AIPs) 5. Dev mailing list discussions
  11. Note for Using AI to Contribute • Short answer: Yes

    ! • Understand the context of the issue ourselves • Take responsibility for the generated content
  12. How to become a Committer? • Contribute consistently • Participate

    in discussions • Get noticed by the community Demonstrate your impact!
  13. OpenSource4You (源來適你) • Actively contributing to OSS from Taiwan •

    Each project has at least one mentor • Projects under ASF ◦ Airflow, Ambari, Datafusion ◦ Gravitino, Kafka, Ozone, YuniKorn • Flyte, KubeRay, Ray, Liger-Kernel, etc Facebook / GitHub / Slack: opensource4you