Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Public data repository and analysis pipeline fo...
Search
Tazro Inutano Ohta
July 11, 2014
Science
0
320
Public data repository and analysis pipeline for high-throughput sequencing
特定非営利活動法人酵母細胞研究会 第186回例会 次世代シーケンサーを活用した研究事例と、それを支える公共ツール・データベース
Tazro Inutano Ohta
July 11, 2014
Tweet
Share
More Decks by Tazro Inutano Ohta
See All by Tazro Inutano Ohta
Yevis: System to support building a workflow registry with automated quality control
inutano
0
110
Standardization of biological sample information database
inutano
0
70
Describe data analysis workflow with workflow languages
inutano
5
5.2k
Container virtualization technologies and workflow languages improve portability and reproducibility of data analysis environment
inutano
3
340
次世代シーケンサーによるメタゲノム解析:桜の花びらに付着した環境DNAを解析する
inutano
0
95
Workflows that run everywhere and where to run them
inutano
0
150
The Sequence Read Archive search system to make use of public high-throughput sequencing data
inutano
0
290
Improve portability of bioinformatics software across HPC and cloud infrastructures
inutano
1
110
Container, Cloud, and HPC
inutano
0
170
Other Decks in Science
See All in Science
07_浮世満理子_アイディア高等学院学院長_一般社団法人全国心理業連合会代表理事_紹介資料.pdf
sip3ristex
0
500
データマイニング - グラフデータと経路
trycycle
PRO
1
150
Explanatory material
yuki1986
0
330
データベース06: SQL (3/3) 副問い合わせ
trycycle
PRO
1
550
機械学習 - K近傍法 & 機械学習のお作法
trycycle
PRO
0
1.2k
2025-06-11-ai_belgium
sofievl
1
130
生成検索エンジン最適化に関する研究の紹介
ynakano
2
1.1k
地表面抽出の方法であるSMRFについて紹介
kentaitakura
1
760
Transport information Geometry: Current and Future II
lwc2017
0
160
データベース10: 拡張実体関連モデル
trycycle
PRO
0
720
04_石井クンツ昌子_お茶の水女子大学理事_副学長_D_I社会実現へ向けて.pdf
sip3ristex
0
510
Design of three-dimensional binary manipulators for pick-and-place task avoiding obstacles (IECON2024)
konakalab
0
220
Featured
See All Featured
Optimizing for Happiness
mojombo
379
70k
Music & Morning Musume
bryan
46
6.6k
Speed Design
sergeychernyshev
32
1k
Designing Dashboards & Data Visualisations in Web Apps
destraynor
231
53k
Rebuilding a faster, lazier Slack
samanthasiow
82
9.1k
The Myth of the Modular Monolith - Day 2 Keynote - Rails World 2024
eileencodes
26
2.9k
Refactoring Trust on Your Teams (GOTO; Chicago 2020)
rmw
34
3.1k
Git: the NoSQL Database
bkeepers
PRO
430
65k
GitHub's CSS Performance
jonrohan
1031
460k
The Art of Programming - Codeland 2020
erikaheidi
54
13k
The Success of Rails: Ensuring Growth for the Next 100 Years
eileencodes
45
7.5k
Building Better People: How to give real-time feedback that sticks.
wjessup
367
19k
Transcript
࣍ੈγʔέϯαʔΛར༻ͨ͠ݚڀࣄྫͱͦΕΛࢧ͑Δެڞπʔϧɾσʔλϕʔε Public data repository and analysis pipeline for high-throughput sequencing
ใɾγεςϜݚڀػߏ ϥΠϑαΠΤϯε౷߹σʔλϕʔεηϯλʔ େా ୡ <
[email protected]
> ! prepared for ୈ186ճ ߬ࡉ๔ݚڀձ ྫձ July 11, 2014
Agenda ‣ %#$-4ͱ౷߹%#ϓϩδΣΫτʹ͍ͭͯ ‣ /(4ʹؔ࿈͢Δσʔλϕʔε ‣ /(4Λͬͨݚڀϑϩʔʹ͓͚Δެ։%#ͷׂ ‣ ެڞσʔλͷݕࡧ͔Βղੳ·Ͱ
DBCLSͱ౷߹σʔλϕʔεϓϩδΣΫτʹ͍ͭͯ Database Integration Project and DBCLS
DBCLSͱ౷߹σʔλϕʔεϓϩδΣΫτʹ͍ͭͯ ‣ େֶڞಉར༻ػؔ๏ਓใɾγεςϜݚڀػߏ 30*4 ࡿԼ ‣ +45ࡿԼͷ/#%$ ಉ͘͡30*4ࡿԼͷҨݚ%%#+ͱ࿈ܞ ‣ /#%$ϑΝϯσΟϯάɼ%%#+σʔλΞʔΧΠϒɼ%#$-4ٕज़։ൃΛ୲
ϥΠϑαΠΤϯε౷߹σʔλϕʔεηϯλʔ %#$-4 ɺੜ໋Պֶʹ͓͚Δ σʔλެ։ͷଅਐͱσʔλϕʔεߏஙʹࢿ͢Δٕज़ͷݚڀ։ൃΛߦ͏ݚڀॴͰ͢ɻ
http://dbcls.rois.ac.jp/about
DBCLSͱ౷߹ϓϩδΣΫτ: ͜Ε·Ͱʹ։ൃɾӡ༻͖ͯͨ͠αʔϏε ‣ *OUFHCJPσʔλϕʔεΧλϩά ‣ ੜ໋ՊֶσʔλϕʔεΞʔΧΠϒ ‣ ͦͷଞɼσʔλϕʔεԣஅݕࡧͳͲ%#౷߹ʹࢿ͢ΔαʔϏε ‣ ݸผʹݚڀ։ൃΛߦ͍ͬͯΔٕज़ͷԠ༻ͱͯ͠ͷαʔϏε
‣ UPHPHFOPNF ((3/" 3FG&Y ৽ணจϨϏϡʔ ౷߹57 *O.F9FT FUD
Database of Databases: Integbio DBcatalog http://integbio.jp/dbcatalog
ੜछΧςΰϦʹΑΔߜࠐ͕Մೳ http://integbio.jp/dbcatalog
DBͷҡ࣋ɼҾ͖ड͚·͢ http://dbarchive.biosciencedbc.jp/
ҰׅDLར༻ڐཧΛαϙʔτ http://dbarchive.biosciencedbc.jp/
Find more at http://biosciencedbc.jp
togogenome.org ggrna.dbcls.jp refex.dbcls.jp first.lifesciencedb.jp togotv.dbcls.jp docman.dbcls.jp/im
togogenome.org ggrna.dbcls.jp refex.dbcls.jp first.lifesciencedb.jp togotv.dbcls.jp docman.dbcls.jp/im ήϊϜใ/ՄࢹԽ ߴԘجྻݕࡧ ҨࢠൃݱϦϑΝϨϯε ຊޠจϨϏϡʔ
ಈըνϡʔτϦΞϧ จࣥචαϙʔτ
Find more at http://dbcls.rois.ac.jp/services
NGSʹؔ࿈͢Δσʔλϕʔε Data Repositories and Databases for high-throughput sequencing
NGSʹؔ࿈͢Δσʔλϕʔεɾެ։σʔλϨϙδτϦ ‣ ࠃࡍԘجྻσʔλϕʔεͱ4FRVFODF3FBE"SDIJWF ‣ ڊେϓϩδΣΫτʹΑΔσʔλϗεςΟϯά
ࠃࡍԘجྻσʔλϕʔεͱSequence Read Archive ‣ */4%$*OU`M/VDMFPUJEF4FRVFODF%BUBCBTF$PMMBCPSBUJPO ‣ /$#* &#* %%#+ہͷ୲νʔϜ͕ڞಉͰӡ༻ ‣
4FRVFODF3FBE"SDIJWF/(4ͷͨΊͷ1SJNBSZEBUBSFQP www.insdc.org
ڊେϓϩδΣΫτʹΑΔσʔλϗεςΟϯά ‣ نͷେ͖ͳϓϩδΣΫτͰࣗΒσʔλΛެ։͢Δ߹͕͋Δ ‣ (FOPNFT1SPKFDUIUUQHFOPNFTPSH ‣ 5IF$BODFS(FOPNF"UMBT1SPKFDUIUUQUDHBEBUBODJOJIHPW ‣ &/$0%&1SPKFDUIUUQHFOPNFVDTDFEVFODPEF ‣
σʔλͷίϐʔ͕ΫϥυαʔϏε্ʹެ։͞Ε͍ͯΔ͜ͱ ‣ HFOPNFTPO"84IUUQBXTBNB[PODPNHFOPNFT
σʔλͱσʔλϕʔεͷ֊ʹ͍ͭͯ Knowledge Summarised Data Experimental Data Knowledge-base Database Primary Data
Repository Biological Information “Database”
NGSʹؔ࿈͢Δσʔλϕʔε Knowledge-base Database Primary Data Repository
NGSʹؔ࿈͢Δσʔλϕʔε Knowledge-base Database Primary Data Repository
NGSΛͬͨݚڀϑϩʔʹ͓͚Δެ։DBͷׂ The role of database for each steps of sequencing
research procedure
ҰൠతͳNGSΛ༻͍ͨݚڀϑϩʔ ࣮ݧσβΠϯ ༧උ࣮ݧ αϯϓϦϯά DNAௐ ϥΠϒϥϦ࡞ γʔέϯε QC ϑΟϧλϦϯά alignment/assemble
QC తผղੳ ֬ೝ࣮ݧ σʔλެ։ จߘ ϦόΠζ/Ճ࣮ݧ ΞΫηϓτ ҿΈձ
ެڞσʔλϕʔε͕ؔΘΔεςοϓ ࣮ݧσβΠϯ ༧උ࣮ݧ αϯϓϦϯά DNAௐ ϥΠϒϥϦ࡞ γʔέϯε QC ϑΟϧλϦϯά alignment/assemble
QC తผղੳ ֬ೝ࣮ݧ σʔλެ։ จߘ ϦόΠζ/Ճ࣮ݧ ΞΫηϓτ ҿΈձ
ެڞσʔλΛར༻ͨ͠NGSݚڀͷσβΠϯ ‣ γʔέϯεલͷ࣮ݧσβΠϯ ‣ ྨࣅσʔλΛղੳ͢Δ͜ͱͰγʔέϯεޙͷྲྀΕΛςετ͢Δ ‣ γʔέϯεޙɺσʔλղੳͰ ‣ γʔέϯε݁ՌͷଥੑΛݕ౼͢Δ ‣
ࣗલͷσʔλͱൺֱղੳΛߦ͏ ‣ σʔλղੳޙɺՌൃදͷͰ ‣ σʔλΛϨϙδτϦʹެ։͢Δ
ެڞσʔλͷݕࡧ͔Βղੳ·Ͱ Search, Download, and Data Analysis of Public Sequencing Data
ެڞσʔλͷμϯϩʔυ͔Βղੳ·Ͱ ‣ ϨϙδτϦͷݕࡧػೳͰ୳͢ ‣ /$#* &#* %%#+ͷݕࡧΛར༻͢Δ ‣ σʔλͷ*%͕ࣄલʹ͔͍ͬͯΔ߹ʹ༗ޮ ‣
จ࣬ױͳͲͷؔ࿈ใ͔Β୳͢ ‣ %#$-443"Λར༻͢Δ ‣ ެڞͷղੳαʔϏεΛར༻ͯ͠ղੳ͢Δ ‣ %%#+3FBE"OOPUBUJPO1JQFMJOF ‣ .VEJ.VUBUJPO%JTDPWFSZJOZFBTU
ϨϙδτϦͷݕࡧػೳͷ͍ํ - github.com/inutano/sra_metadata_toolkit/wiki
DBCLS SRAΛར༻͢Δ - http://sra.dbcls.jp
DBCLS SRAΛར༻͢Δ - http://sra.dbcls.jp
จ͔Β୳͢ - http://sra.dbcls.jp/cgi-bin/publication.cgi
Ωʔϫʔυશจݕࡧ - http://sra.dbcls.jp/search
Ωʔϫʔυશจݕࡧ - http://sra.dbcls.jp/search
Ωʔϫʔυશจݕࡧ - http://sra.dbcls.jp/search
Ωʔϫʔυશจݕࡧ - http://sra.dbcls.jp/search
Ωʔϫʔυશจݕࡧ - http://sra.dbcls.jp/search
ެڞNGSղੳύΠϓϥΠϯ DDBJ Read Annotation Pipeline - http://p.ddbj.nig.ac.jp
ެڞNGSղੳύΠϓϥΠϯ DDBJ Read Annotation Pipeline - http://p.ddbj.nig.ac.jp
͍ํDDBJߨशձͰ (ࢿྉըެ։͞Ε͍ͯ·͢) http://www.ddbj.nig.ac.jp/ddbjing/
Mudi: Mutation discovery in yeast - http://naoii.nig.ac.jp/mudi_top.html
Mudi: Mutation discovery in yeast - http://naoii.nig.ac.jp/mudi_top.html
Summary ‣ %#$-4ͱ౷߹%#ϓϩδΣΫτࠃͷੜ໋ՊֶϦιʔεΛ උɾ౷߹͍ͯ͠·͢ ‣ ެڞ%#Ͱެ։͞ΕͨσʔλΛ༗ޮʹར༻͢Δ͜ͱͰ ݚڀϑϩʔͷޮԽΛਤΕ·͢ ‣ ݕࡧղੳʹެڞαʔϏεΛར༻͢Δ͜ͱͰ σʔλղੳͷίετԽ͕ਤΕ·͢
Thank you! ͝ਗ਼ௌ͋Γ͕ͱ͏͍͟͝·ͨ͠ !
[email protected]
http://speakerdeck.com/inutano