Skip to main content
 

High-performance Integrated Virtual Environment (HIVE) for Next-Generation Sequencing Analysis Infrastructure

Friday, November 08, 2013 — Poster Session III

10:00 a.m. – 12:00 p.m.

FAES Academic Center (Upper-Level Terrace)

FDA/CBER

GEN-34

Authors

  • Alin Voskanian
  • Anton Golikov
  • Raja Mazumder
  • Vahan Simonyan

Abstract

High-performance Integrated Virtual Environment (HIVE) is a cloud-based environment optimized for the storage and analysis of extra-large data. This environment provides web access for authorized users to deposit, retrieve, annotate and compute on HTS data and analyze the outcomes using web-interface visual environments appropriately built in collaboration with research scientists and regulatory personnel. HIVE has a multicomponent cloud infrastructure, comprised of a distributed storage library and a distributed computational powerhouse which is highly parallelized. This infrastructure allows the researchers and their collaborators to perform their HTS analysis using algorithms that are available within the environment. The various algorithms allow for NextGen sequence alignments to reference genomes, base calling and SNP calling analysis, assessment of post-alignment quality control using positional base-frequency entropic information, analysis of Meta-genomic recombination, clonal population discovery and many others. All these utilities are organized within the HIVE interface that grants users access to sequence datasets, user files, algorithmic utilities, analysis pipelines, and results of computations. The interface also allows for self-registration web pages while still maintain in an authentication and supervisor confirmation notification system adding a capability to manage account registration confirmation emails and provides functionality for group administrators to verify joining members.

back to top