output is approximately 0.5 - 1TB per dataset • intermediate files have similar order of file size • too large • Automation • in previous work, I do everything using by hand • fast for only 50 miRNAs • exhausting and prone to err when processing 50 many datasets • will be ported to Galaxy framework in the future Bioinformatics and Biostatistics Core, NTU Center of Genomic Medicine 6