Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Public data repository and analysis pipeline fo...
Search
Tazro Inutano Ohta
July 11, 2014
Science
0
340
Public data repository and analysis pipeline for high-throughput sequencing
特定非営利活動法人酵母細胞研究会 第186回例会 次世代シーケンサーを活用した研究事例と、それを支える公共ツール・データベース
Tazro Inutano Ohta
July 11, 2014
Tweet
Share
More Decks by Tazro Inutano Ohta
See All by Tazro Inutano Ohta
Yevis: System to support building a workflow registry with automated quality control
inutano
0
130
Standardization of biological sample information database
inutano
0
77
Describe data analysis workflow with workflow languages
inutano
5
5.6k
Container virtualization technologies and workflow languages improve portability and reproducibility of data analysis environment
inutano
3
350
次世代シーケンサーによるメタゲノム解析:桜の花びらに付着した環境DNAを解析する
inutano
0
110
Workflows that run everywhere and where to run them
inutano
0
160
The Sequence Read Archive search system to make use of public high-throughput sequencing data
inutano
0
300
Improve portability of bioinformatics software across HPC and cloud infrastructures
inutano
1
120
Container, Cloud, and HPC
inutano
0
180
Other Decks in Science
See All in Science
データベース03: 関係データモデル
trycycle
PRO
1
300
Performance Evaluation and Ranking of Drivers in Multiple Motorsports Using Massey’s Method
konakalab
0
120
機械学習 - 決定木からはじめる機械学習
trycycle
PRO
0
1.2k
DMMにおけるABテスト検証設計の工夫
xc6da
1
1.3k
データベース14: B+木 & ハッシュ索引
trycycle
PRO
0
530
データマイニング - ノードの中心性
trycycle
PRO
0
300
【論文紹介】Is CLIP ideal? No. Can we fix it?Yes! 第65回 コンピュータビジョン勉強会@関東
shun6211
5
1.6k
機械学習 - SVM
trycycle
PRO
1
920
データマイニング - コミュニティ発見
trycycle
PRO
0
170
機械学習 - DBSCAN
trycycle
PRO
0
1.3k
なぜ21は素因数分解されないのか? - Shorのアルゴリズムの現在と壁
daimurat
0
170
風の力で振れ幅が大きくなる振り子!? 〜タコマナローズ橋はなぜ落ちたのか〜
syotasasaki593876
1
140
Featured
See All Featured
Refactoring Trust on Your Teams (GOTO; Chicago 2020)
rmw
35
3.2k
The Illustrated Children's Guide to Kubernetes
chrisshort
51
51k
Rails Girls Zürich Keynote
gr2m
95
14k
Thoughts on Productivity
jonyablonski
73
4.9k
CSS Pre-Processors: Stylus, Less & Sass
bermonpainter
359
30k
Designing Experiences People Love
moore
142
24k
Fashionably flexible responsive web design (full day workshop)
malarkey
407
66k
[RailsConf 2023] Rails as a piece of cake
palkan
57
6.1k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
46
2.6k
Code Reviewing Like a Champion
maltzj
527
40k
Dealing with People You Can't Stand - Big Design 2015
cassininazir
367
27k
How STYLIGHT went responsive
nonsquared
100
5.9k
Transcript
࣍ੈγʔέϯαʔΛར༻ͨ͠ݚڀࣄྫͱͦΕΛࢧ͑Δެڞπʔϧɾσʔλϕʔε Public data repository and analysis pipeline for high-throughput sequencing
ใɾγεςϜݚڀػߏ ϥΠϑαΠΤϯε౷߹σʔλϕʔεηϯλʔ େా ୡ <
[email protected]
> ! prepared for ୈ186ճ ߬ࡉ๔ݚڀձ ྫձ July 11, 2014
Agenda ‣ %#$-4ͱ౷߹%#ϓϩδΣΫτʹ͍ͭͯ ‣ /(4ʹؔ࿈͢Δσʔλϕʔε ‣ /(4Λͬͨݚڀϑϩʔʹ͓͚Δެ։%#ͷׂ ‣ ެڞσʔλͷݕࡧ͔Βղੳ·Ͱ
DBCLSͱ౷߹σʔλϕʔεϓϩδΣΫτʹ͍ͭͯ Database Integration Project and DBCLS
DBCLSͱ౷߹σʔλϕʔεϓϩδΣΫτʹ͍ͭͯ ‣ େֶڞಉར༻ػؔ๏ਓใɾγεςϜݚڀػߏ 30*4 ࡿԼ ‣ +45ࡿԼͷ/#%$ ಉ͘͡30*4ࡿԼͷҨݚ%%#+ͱ࿈ܞ ‣ /#%$ϑΝϯσΟϯάɼ%%#+σʔλΞʔΧΠϒɼ%#$-4ٕज़։ൃΛ୲
ϥΠϑαΠΤϯε౷߹σʔλϕʔεηϯλʔ %#$-4 ɺੜ໋Պֶʹ͓͚Δ σʔλެ։ͷଅਐͱσʔλϕʔεߏஙʹࢿ͢Δٕज़ͷݚڀ։ൃΛߦ͏ݚڀॴͰ͢ɻ
http://dbcls.rois.ac.jp/about
DBCLSͱ౷߹ϓϩδΣΫτ: ͜Ε·Ͱʹ։ൃɾӡ༻͖ͯͨ͠αʔϏε ‣ *OUFHCJPσʔλϕʔεΧλϩά ‣ ੜ໋ՊֶσʔλϕʔεΞʔΧΠϒ ‣ ͦͷଞɼσʔλϕʔεԣஅݕࡧͳͲ%#౷߹ʹࢿ͢ΔαʔϏε ‣ ݸผʹݚڀ։ൃΛߦ͍ͬͯΔٕज़ͷԠ༻ͱͯ͠ͷαʔϏε
‣ UPHPHFOPNF ((3/" 3FG&Y ৽ணจϨϏϡʔ ౷߹57 *O.F9FT FUD
Database of Databases: Integbio DBcatalog http://integbio.jp/dbcatalog
ੜछΧςΰϦʹΑΔߜࠐ͕Մೳ http://integbio.jp/dbcatalog
DBͷҡ࣋ɼҾ͖ड͚·͢ http://dbarchive.biosciencedbc.jp/
ҰׅDLར༻ڐཧΛαϙʔτ http://dbarchive.biosciencedbc.jp/
Find more at http://biosciencedbc.jp
togogenome.org ggrna.dbcls.jp refex.dbcls.jp first.lifesciencedb.jp togotv.dbcls.jp docman.dbcls.jp/im
togogenome.org ggrna.dbcls.jp refex.dbcls.jp first.lifesciencedb.jp togotv.dbcls.jp docman.dbcls.jp/im ήϊϜใ/ՄࢹԽ ߴԘجྻݕࡧ ҨࢠൃݱϦϑΝϨϯε ຊޠจϨϏϡʔ
ಈըνϡʔτϦΞϧ จࣥචαϙʔτ
Find more at http://dbcls.rois.ac.jp/services
NGSʹؔ࿈͢Δσʔλϕʔε Data Repositories and Databases for high-throughput sequencing
NGSʹؔ࿈͢Δσʔλϕʔεɾެ։σʔλϨϙδτϦ ‣ ࠃࡍԘجྻσʔλϕʔεͱ4FRVFODF3FBE"SDIJWF ‣ ڊେϓϩδΣΫτʹΑΔσʔλϗεςΟϯά
ࠃࡍԘجྻσʔλϕʔεͱSequence Read Archive ‣ */4%$*OU`M/VDMFPUJEF4FRVFODF%BUBCBTF$PMMBCPSBUJPO ‣ /$#* &#* %%#+ہͷ୲νʔϜ͕ڞಉͰӡ༻ ‣
4FRVFODF3FBE"SDIJWF/(4ͷͨΊͷ1SJNBSZEBUBSFQP www.insdc.org
ڊେϓϩδΣΫτʹΑΔσʔλϗεςΟϯά ‣ نͷେ͖ͳϓϩδΣΫτͰࣗΒσʔλΛެ։͢Δ߹͕͋Δ ‣ (FOPNFT1SPKFDUIUUQHFOPNFTPSH ‣ 5IF$BODFS(FOPNF"UMBT1SPKFDUIUUQUDHBEBUBODJOJIHPW ‣ &/$0%&1SPKFDUIUUQHFOPNFVDTDFEVFODPEF ‣
σʔλͷίϐʔ͕ΫϥυαʔϏε্ʹެ։͞Ε͍ͯΔ͜ͱ ‣ HFOPNFTPO"84IUUQBXTBNB[PODPNHFOPNFT
σʔλͱσʔλϕʔεͷ֊ʹ͍ͭͯ Knowledge Summarised Data Experimental Data Knowledge-base Database Primary Data
Repository Biological Information “Database”
NGSʹؔ࿈͢Δσʔλϕʔε Knowledge-base Database Primary Data Repository
NGSʹؔ࿈͢Δσʔλϕʔε Knowledge-base Database Primary Data Repository
NGSΛͬͨݚڀϑϩʔʹ͓͚Δެ։DBͷׂ The role of database for each steps of sequencing
research procedure
ҰൠతͳNGSΛ༻͍ͨݚڀϑϩʔ ࣮ݧσβΠϯ ༧උ࣮ݧ αϯϓϦϯά DNAௐ ϥΠϒϥϦ࡞ γʔέϯε QC ϑΟϧλϦϯά alignment/assemble
QC తผղੳ ֬ೝ࣮ݧ σʔλެ։ จߘ ϦόΠζ/Ճ࣮ݧ ΞΫηϓτ ҿΈձ
ެڞσʔλϕʔε͕ؔΘΔεςοϓ ࣮ݧσβΠϯ ༧උ࣮ݧ αϯϓϦϯά DNAௐ ϥΠϒϥϦ࡞ γʔέϯε QC ϑΟϧλϦϯά alignment/assemble
QC తผղੳ ֬ೝ࣮ݧ σʔλެ։ จߘ ϦόΠζ/Ճ࣮ݧ ΞΫηϓτ ҿΈձ
ެڞσʔλΛར༻ͨ͠NGSݚڀͷσβΠϯ ‣ γʔέϯεલͷ࣮ݧσβΠϯ ‣ ྨࣅσʔλΛղੳ͢Δ͜ͱͰγʔέϯεޙͷྲྀΕΛςετ͢Δ ‣ γʔέϯεޙɺσʔλղੳͰ ‣ γʔέϯε݁ՌͷଥੑΛݕ౼͢Δ ‣
ࣗલͷσʔλͱൺֱղੳΛߦ͏ ‣ σʔλղੳޙɺՌൃදͷͰ ‣ σʔλΛϨϙδτϦʹެ։͢Δ
ެڞσʔλͷݕࡧ͔Βղੳ·Ͱ Search, Download, and Data Analysis of Public Sequencing Data
ެڞσʔλͷμϯϩʔυ͔Βղੳ·Ͱ ‣ ϨϙδτϦͷݕࡧػೳͰ୳͢ ‣ /$#* &#* %%#+ͷݕࡧΛར༻͢Δ ‣ σʔλͷ*%͕ࣄલʹ͔͍ͬͯΔ߹ʹ༗ޮ ‣
จ࣬ױͳͲͷؔ࿈ใ͔Β୳͢ ‣ %#$-443"Λར༻͢Δ ‣ ެڞͷղੳαʔϏεΛར༻ͯ͠ղੳ͢Δ ‣ %%#+3FBE"OOPUBUJPO1JQFMJOF ‣ .VEJ.VUBUJPO%JTDPWFSZJOZFBTU
ϨϙδτϦͷݕࡧػೳͷ͍ํ - github.com/inutano/sra_metadata_toolkit/wiki
DBCLS SRAΛར༻͢Δ - http://sra.dbcls.jp
DBCLS SRAΛར༻͢Δ - http://sra.dbcls.jp
จ͔Β୳͢ - http://sra.dbcls.jp/cgi-bin/publication.cgi
Ωʔϫʔυશจݕࡧ - http://sra.dbcls.jp/search
Ωʔϫʔυશจݕࡧ - http://sra.dbcls.jp/search
Ωʔϫʔυશจݕࡧ - http://sra.dbcls.jp/search
Ωʔϫʔυશจݕࡧ - http://sra.dbcls.jp/search
Ωʔϫʔυશจݕࡧ - http://sra.dbcls.jp/search
ެڞNGSղੳύΠϓϥΠϯ DDBJ Read Annotation Pipeline - http://p.ddbj.nig.ac.jp
ެڞNGSղੳύΠϓϥΠϯ DDBJ Read Annotation Pipeline - http://p.ddbj.nig.ac.jp
͍ํDDBJߨशձͰ (ࢿྉըެ։͞Ε͍ͯ·͢) http://www.ddbj.nig.ac.jp/ddbjing/
Mudi: Mutation discovery in yeast - http://naoii.nig.ac.jp/mudi_top.html
Mudi: Mutation discovery in yeast - http://naoii.nig.ac.jp/mudi_top.html
Summary ‣ %#$-4ͱ౷߹%#ϓϩδΣΫτࠃͷੜ໋ՊֶϦιʔεΛ උɾ౷߹͍ͯ͠·͢ ‣ ެڞ%#Ͱެ։͞ΕͨσʔλΛ༗ޮʹར༻͢Δ͜ͱͰ ݚڀϑϩʔͷޮԽΛਤΕ·͢ ‣ ݕࡧղੳʹެڞαʔϏεΛར༻͢Δ͜ͱͰ σʔλղੳͷίετԽ͕ਤΕ·͢
Thank you! ͝ਗ਼ௌ͋Γ͕ͱ͏͍͟͝·ͨ͠ !
[email protected]
http://speakerdeck.com/inutano