The RDS Omics project is building a platform of cloud-based data services and tools for Australian researchers so they combine “multi-omics” data (genomic, transcriptomic, proteomic and metabolomic data, and then analyse and interpret the combined data.
The platform is composed of three major components:
1. The data management platform (DMP),
2. GenomeSpace and
The figure below shows a high level view of how they are interconnected in terms of data flows.
From the view of a user, the three are used together as follows in terms of a simple workflow:
A – A biologist logs into the DMP portal and they query for a type of data in the platform. For example, they might request all genomics fastq files acquired in a specific time frame for the streptococcus pyogenes bacteria.
B – The DMP then sends the resultant datasets to the user’s GenomeSpace portal. The flexibility of GenomeSpace allows the user to import other data from other sources into GenomeSpace if they wish.
C – The user then analyses data using the microbial-GVL. Alternatively, as GenomeSpace is integrated with many other highly-used analysis and visualisation tools, users can also choose to analyse on or visualise their data outside of the microbial GVL.
D – Results of analysis are sent back to GenomeSpace portal, where they can be shared or published. Analyses can be recorded as workflows for the microbial-GVL and saved as publishable, reproducible entities in GenomeSpace.