Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Public data repository and analysis pipeline fo...
Search
Tazro Inutano Ohta
July 11, 2014
Science
0
320
Public data repository and analysis pipeline for high-throughput sequencing
特定非営利活動法人酵母細胞研究会 第186回例会 次世代シーケンサーを活用した研究事例と、それを支える公共ツール・データベース
Tazro Inutano Ohta
July 11, 2014
Tweet
Share
More Decks by Tazro Inutano Ohta
See All by Tazro Inutano Ohta
Yevis: System to support building a workflow registry with automated quality control
inutano
0
110
Standardization of biological sample information database
inutano
0
68
Describe data analysis workflow with workflow languages
inutano
5
5.2k
Container virtualization technologies and workflow languages improve portability and reproducibility of data analysis environment
inutano
3
340
次世代シーケンサーによるメタゲノム解析:桜の花びらに付着した環境DNAを解析する
inutano
0
93
Workflows that run everywhere and where to run them
inutano
0
150
The Sequence Read Archive search system to make use of public high-throughput sequencing data
inutano
0
280
Improve portability of bioinformatics software across HPC and cloud infrastructures
inutano
1
110
Container, Cloud, and HPC
inutano
0
160
Other Decks in Science
See All in Science
学術講演会中央大学学員会府中支部
tagtag
0
270
サイゼミ用因果推論
lw
1
7.3k
データベース02: データベースの概念
trycycle
PRO
2
750
Symfony Console Facelift
chalasr
2
450
Hakonwa-Quaternion
hiranabe
1
110
眼科AIコンテスト2024_特別賞_6位Solution
pon0matsu
0
400
SciPyDataJapan 2025
schwalbe10
0
240
科学で迫る勝敗の法則(名城大学公開講座.2024年10月) / The principle of victory discovered by science (Open lecture in Meijo Univ. 2024)
konakalab
0
350
動的トリートメント・レジームを推定するDynTxRegimeパッケージ
saltcooky12
0
140
テンソル分解による糖尿病の組織特異的遺伝子発現の統合解析を用いた関連疾患の予測
tagtag
2
190
MCMCのR-hatは分散分析である
moricup
0
360
データマイニング - グラフデータと経路
trycycle
PRO
1
130
Featured
See All Featured
Designing Experiences People Love
moore
142
24k
Visualizing Your Data: Incorporating Mongo into Loggly Infrastructure
mongodb
46
9.6k
Rails Girls Zürich Keynote
gr2m
94
14k
The Straight Up "How To Draw Better" Workshop
denniskardys
234
140k
Chrome DevTools: State of the Union 2024 - Debugging React & Beyond
addyosmani
7
700
Stop Working from a Prison Cell
hatefulcrawdad
270
20k
Making the Leap to Tech Lead
cromwellryan
134
9.3k
Visualization
eitanlees
146
16k
Facilitating Awesome Meetings
lara
54
6.4k
CoffeeScript is Beautiful & I Never Want to Write Plain JavaScript Again
sstephenson
161
15k
YesSQL, Process and Tooling at Scale
rocio
173
14k
Done Done
chrislema
184
16k
Transcript
࣍ੈγʔέϯαʔΛར༻ͨ͠ݚڀࣄྫͱͦΕΛࢧ͑Δެڞπʔϧɾσʔλϕʔε Public data repository and analysis pipeline for high-throughput sequencing
ใɾγεςϜݚڀػߏ ϥΠϑαΠΤϯε౷߹σʔλϕʔεηϯλʔ େా ୡ <
[email protected]
> ! prepared for ୈ186ճ ߬ࡉ๔ݚڀձ ྫձ July 11, 2014
Agenda ‣ %#$-4ͱ౷߹%#ϓϩδΣΫτʹ͍ͭͯ ‣ /(4ʹؔ࿈͢Δσʔλϕʔε ‣ /(4Λͬͨݚڀϑϩʔʹ͓͚Δެ։%#ͷׂ ‣ ެڞσʔλͷݕࡧ͔Βղੳ·Ͱ
DBCLSͱ౷߹σʔλϕʔεϓϩδΣΫτʹ͍ͭͯ Database Integration Project and DBCLS
DBCLSͱ౷߹σʔλϕʔεϓϩδΣΫτʹ͍ͭͯ ‣ େֶڞಉར༻ػؔ๏ਓใɾγεςϜݚڀػߏ 30*4 ࡿԼ ‣ +45ࡿԼͷ/#%$ ಉ͘͡30*4ࡿԼͷҨݚ%%#+ͱ࿈ܞ ‣ /#%$ϑΝϯσΟϯάɼ%%#+σʔλΞʔΧΠϒɼ%#$-4ٕज़։ൃΛ୲
ϥΠϑαΠΤϯε౷߹σʔλϕʔεηϯλʔ %#$-4 ɺੜ໋Պֶʹ͓͚Δ σʔλެ։ͷଅਐͱσʔλϕʔεߏஙʹࢿ͢Δٕज़ͷݚڀ։ൃΛߦ͏ݚڀॴͰ͢ɻ
http://dbcls.rois.ac.jp/about
DBCLSͱ౷߹ϓϩδΣΫτ: ͜Ε·Ͱʹ։ൃɾӡ༻͖ͯͨ͠αʔϏε ‣ *OUFHCJPσʔλϕʔεΧλϩά ‣ ੜ໋ՊֶσʔλϕʔεΞʔΧΠϒ ‣ ͦͷଞɼσʔλϕʔεԣஅݕࡧͳͲ%#౷߹ʹࢿ͢ΔαʔϏε ‣ ݸผʹݚڀ։ൃΛߦ͍ͬͯΔٕज़ͷԠ༻ͱͯ͠ͷαʔϏε
‣ UPHPHFOPNF ((3/" 3FG&Y ৽ணจϨϏϡʔ ౷߹57 *O.F9FT FUD
Database of Databases: Integbio DBcatalog http://integbio.jp/dbcatalog
ੜछΧςΰϦʹΑΔߜࠐ͕Մೳ http://integbio.jp/dbcatalog
DBͷҡ࣋ɼҾ͖ड͚·͢ http://dbarchive.biosciencedbc.jp/
ҰׅDLར༻ڐཧΛαϙʔτ http://dbarchive.biosciencedbc.jp/
Find more at http://biosciencedbc.jp
togogenome.org ggrna.dbcls.jp refex.dbcls.jp first.lifesciencedb.jp togotv.dbcls.jp docman.dbcls.jp/im
togogenome.org ggrna.dbcls.jp refex.dbcls.jp first.lifesciencedb.jp togotv.dbcls.jp docman.dbcls.jp/im ήϊϜใ/ՄࢹԽ ߴԘجྻݕࡧ ҨࢠൃݱϦϑΝϨϯε ຊޠจϨϏϡʔ
ಈըνϡʔτϦΞϧ จࣥචαϙʔτ
Find more at http://dbcls.rois.ac.jp/services
NGSʹؔ࿈͢Δσʔλϕʔε Data Repositories and Databases for high-throughput sequencing
NGSʹؔ࿈͢Δσʔλϕʔεɾެ։σʔλϨϙδτϦ ‣ ࠃࡍԘجྻσʔλϕʔεͱ4FRVFODF3FBE"SDIJWF ‣ ڊେϓϩδΣΫτʹΑΔσʔλϗεςΟϯά
ࠃࡍԘجྻσʔλϕʔεͱSequence Read Archive ‣ */4%$*OU`M/VDMFPUJEF4FRVFODF%BUBCBTF$PMMBCPSBUJPO ‣ /$#* &#* %%#+ہͷ୲νʔϜ͕ڞಉͰӡ༻ ‣
4FRVFODF3FBE"SDIJWF/(4ͷͨΊͷ1SJNBSZEBUBSFQP www.insdc.org
ڊେϓϩδΣΫτʹΑΔσʔλϗεςΟϯά ‣ نͷେ͖ͳϓϩδΣΫτͰࣗΒσʔλΛެ։͢Δ߹͕͋Δ ‣ (FOPNFT1SPKFDUIUUQHFOPNFTPSH ‣ 5IF$BODFS(FOPNF"UMBT1SPKFDUIUUQUDHBEBUBODJOJIHPW ‣ &/$0%&1SPKFDUIUUQHFOPNFVDTDFEVFODPEF ‣
σʔλͷίϐʔ͕ΫϥυαʔϏε্ʹެ։͞Ε͍ͯΔ͜ͱ ‣ HFOPNFTPO"84IUUQBXTBNB[PODPNHFOPNFT
σʔλͱσʔλϕʔεͷ֊ʹ͍ͭͯ Knowledge Summarised Data Experimental Data Knowledge-base Database Primary Data
Repository Biological Information “Database”
NGSʹؔ࿈͢Δσʔλϕʔε Knowledge-base Database Primary Data Repository
NGSʹؔ࿈͢Δσʔλϕʔε Knowledge-base Database Primary Data Repository
NGSΛͬͨݚڀϑϩʔʹ͓͚Δެ։DBͷׂ The role of database for each steps of sequencing
research procedure
ҰൠతͳNGSΛ༻͍ͨݚڀϑϩʔ ࣮ݧσβΠϯ ༧උ࣮ݧ αϯϓϦϯά DNAௐ ϥΠϒϥϦ࡞ γʔέϯε QC ϑΟϧλϦϯά alignment/assemble
QC తผղੳ ֬ೝ࣮ݧ σʔλެ։ จߘ ϦόΠζ/Ճ࣮ݧ ΞΫηϓτ ҿΈձ
ެڞσʔλϕʔε͕ؔΘΔεςοϓ ࣮ݧσβΠϯ ༧උ࣮ݧ αϯϓϦϯά DNAௐ ϥΠϒϥϦ࡞ γʔέϯε QC ϑΟϧλϦϯά alignment/assemble
QC తผղੳ ֬ೝ࣮ݧ σʔλެ։ จߘ ϦόΠζ/Ճ࣮ݧ ΞΫηϓτ ҿΈձ
ެڞσʔλΛར༻ͨ͠NGSݚڀͷσβΠϯ ‣ γʔέϯεલͷ࣮ݧσβΠϯ ‣ ྨࣅσʔλΛղੳ͢Δ͜ͱͰγʔέϯεޙͷྲྀΕΛςετ͢Δ ‣ γʔέϯεޙɺσʔλղੳͰ ‣ γʔέϯε݁ՌͷଥੑΛݕ౼͢Δ ‣
ࣗલͷσʔλͱൺֱղੳΛߦ͏ ‣ σʔλղੳޙɺՌൃදͷͰ ‣ σʔλΛϨϙδτϦʹެ։͢Δ
ެڞσʔλͷݕࡧ͔Βղੳ·Ͱ Search, Download, and Data Analysis of Public Sequencing Data
ެڞσʔλͷμϯϩʔυ͔Βղੳ·Ͱ ‣ ϨϙδτϦͷݕࡧػೳͰ୳͢ ‣ /$#* &#* %%#+ͷݕࡧΛར༻͢Δ ‣ σʔλͷ*%͕ࣄલʹ͔͍ͬͯΔ߹ʹ༗ޮ ‣
จ࣬ױͳͲͷؔ࿈ใ͔Β୳͢ ‣ %#$-443"Λར༻͢Δ ‣ ެڞͷղੳαʔϏεΛར༻ͯ͠ղੳ͢Δ ‣ %%#+3FBE"OOPUBUJPO1JQFMJOF ‣ .VEJ.VUBUJPO%JTDPWFSZJOZFBTU
ϨϙδτϦͷݕࡧػೳͷ͍ํ - github.com/inutano/sra_metadata_toolkit/wiki
DBCLS SRAΛར༻͢Δ - http://sra.dbcls.jp
DBCLS SRAΛར༻͢Δ - http://sra.dbcls.jp
จ͔Β୳͢ - http://sra.dbcls.jp/cgi-bin/publication.cgi
Ωʔϫʔυશจݕࡧ - http://sra.dbcls.jp/search
Ωʔϫʔυશจݕࡧ - http://sra.dbcls.jp/search
Ωʔϫʔυશจݕࡧ - http://sra.dbcls.jp/search
Ωʔϫʔυશจݕࡧ - http://sra.dbcls.jp/search
Ωʔϫʔυશจݕࡧ - http://sra.dbcls.jp/search
ެڞNGSղੳύΠϓϥΠϯ DDBJ Read Annotation Pipeline - http://p.ddbj.nig.ac.jp
ެڞNGSղੳύΠϓϥΠϯ DDBJ Read Annotation Pipeline - http://p.ddbj.nig.ac.jp
͍ํDDBJߨशձͰ (ࢿྉըެ։͞Ε͍ͯ·͢) http://www.ddbj.nig.ac.jp/ddbjing/
Mudi: Mutation discovery in yeast - http://naoii.nig.ac.jp/mudi_top.html
Mudi: Mutation discovery in yeast - http://naoii.nig.ac.jp/mudi_top.html
Summary ‣ %#$-4ͱ౷߹%#ϓϩδΣΫτࠃͷੜ໋ՊֶϦιʔεΛ උɾ౷߹͍ͯ͠·͢ ‣ ެڞ%#Ͱެ։͞ΕͨσʔλΛ༗ޮʹར༻͢Δ͜ͱͰ ݚڀϑϩʔͷޮԽΛਤΕ·͢ ‣ ݕࡧղੳʹެڞαʔϏεΛར༻͢Δ͜ͱͰ σʔλղੳͷίετԽ͕ਤΕ·͢
Thank you! ͝ਗ਼ௌ͋Γ͕ͱ͏͍͟͝·ͨ͠ !
[email protected]
http://speakerdeck.com/inutano