The Korea Bioinformation Center (KOBIC) provides the Bio-Express large-scale Data Analysis Cloud service for those researchers who need large-scale analysis servers or analysis technology.
The Bio-Express is comprised of
- A big data platform for efficient storage, management and utilization of large-scale bio data,
- CLOSHA Integrated Automatic Analysis System with user-friendly interface and analysis environment,
- GBox High-speed Transmission System that transmits a large amount of data at a fast rate. The big data platform built on the in-house technologies and the Hadoop Distributed File System (HDFS) enables users to use both popular analysis programs and the Hadoop-based big data analysis program simultaneously. We also provide a variety of public data, including 1,000 pieces of genomes data and TCGA data to keep researchers updated with the latest public genome data.
Bio-Express Configuration
BIO-EXPRESS
Bio-Express is a Hadoop-based high-performance infrastructure that consists of high-capacity data high-speed analysis systems and high-speed transmission systems.
CLOSHA
Workflow-based integrated automated analytics system provides an efficient analytics environment
GBox
High-speed transmission of large-scale bioin-
formation Outstanding reliability and stability
User-friendly interface Efficient use of network
bandwidths Retransmissions / Transmission history log data
Why Bio-Express?
- High-performance : Providing high-speed / high-capacity analysis service using high-performance infrastructure system
- Stability : Outstanding reliability and stability
- Hybrid : Simultaneous use of regular analysis programs and Hadoop-based big data analysis program
- Visualization : Ability to download and visualize analysis results
- Convenience : Convenient user interface based on drag & drop
- Monitoring : Ability to monitor the pipeline execution status and results
CLOSHA Integrated Automatic Analysis System
- Perform analysis works through simple workflow modeling
- Uses-friendly interface based on Drag & Drop
- Simultaneous use of regular analysis programs and Hadoop-based big data analysis program
- Ability to download and visualize analysis results
- Ability to monitor the pipeline execution status and results
- Providing a variety of analysis programs/pipelines
GBox High-speed Transmission System
- High-speed transmission of large-scale bioinformation
- Outstanding reliability and stability
- Use-friendly interface
- Efficient use of network bandwidths
- Transmission / Retransmissions history log data
User
Researcher, Government research institution,
Companies, Hospital, Etc
Analysis Service
- ㆍRNA-Sequencing
- ㆍExome Sequencing
- ㆍEpigenomics
- ㆍMetagenomics
- ㆍMetagenomics
- ㆍGWAS
Big Data Analysis Platform
- ㆍHigh-speed Transmission System
- - Data fast compression transfer
- capability
- - Encryption transfer capabilities
- - Various convenience features
- ㆍWorkflow-based
- ㆍWeb platform analysis service
- ㆍCLOSHA
- ㆍIntegrated infrastructure based
- on big data
Large scale storage
- ㆍUser data storage and backup
- ㆍEnhanced user data security
- ㆍAuthorized user data sharing
- ㆍHadoop Distributed File System
- - High-availability and high-efficiency distributed file system construction using off-the-shelf hardware
Hybrid-Cloud Infrastructure
- Strong sercurity policy
- HPC Cluster
- Big data platform
Configuration
User
SSO(Login System)
CLOSHA
KOBIC Web-based Service
- ㆍCloud Analysis Service
- ㆍBio-Data
- ㆍKOBIS
- ㆍVarious Web Analysis Service
HDFS
HDFS File Storage
- ㆍHigh Data Throughput
- ㆍLarge Scale Data Storage Capability
- ㆍSupport for a variety of data types
- ㆍData redundancy integrity
GBox
GBoX
- ㆍEnsuring Fast File Transfer
- ㆍEfficient Bandwidth Utilization
- ㆍProvide a Secure Environment
- ㆍProvides Data Reliability
Big Data Infrastructure Systems
- Built with 100% in-house technology
- Apache Hadoop based big data analysis platform construction
- Service provided after verfication of analysis program
- Maximize performance using integrated resource management solution
- Integration of major analysis programs and hetero-geneous execution environments
- Provides analysis pipeline in various fields
- Provide high-speed analysis infrastructure system
빅데이터 인프라 시스템
빅데이터 인프라 시스템안내로 클라우드 서비스, 플랫폼 기술, 빅데이터 플랫폼을 안내합니다.
Cloud Service |
클라우드 기반 Bio-Express 분석 서비스
클라우드 기반 Bio-Express 분석 서비스를 안내합니다.
Convenient Analytical |
High Speed Transport |
Strong Security |
High Speed Analysis |
Cloud-based Bio-Express Analysis Service
- Analysis Workflow Editor
CLOSHA
- High-speed Transmission System
GBoX
|
|
Big Data Platform |
Workload management and task management |
YARN
SGE
|
Large Scale Data Storage
- ㆍHigh Scalability And Availability
- ㆍLarge Sclae Storage Capability
- ㆍSupport for various data types
- ㆍData redundancy provides integrity
|
File System HDFS
Cache File System SSD & Lustre
|
HPC Cluster System |
Apache Hadoop
InfiniBand Network
|