Skip to main content
 

Nuts and bolts—how to prepare and deal with big data in next-gen sequencing

Monday, October 24, 2011 — Poster Session I

Noon – 2:00 p.m.

Natcher Conference Center

NLM

BIOINFO-2

Authors

  • D Beloslyudtsev
  • C Cope
  • D Preuss

Abstract

As the quantity of data from next generation sequencing continue to grow, previously simple tasks, such as copying a file from one lab to another or submission of a data set to NCBI have become much more difficult. We discuss the methods and pitfalls we have observed in transferring large datasets, many of those lessons previously learned in the high performance and large scale networking community. Particular engineering attention must be paid to by researchers, in what is traditionally IT infrastructure, and failure to do so can exclude the researcher from viable access to data. The areas of high speed networks, high performance storage, network firewalls and connectivity are discussed and evaluated for importance in the success of next-gen sequencing.

back to top