Benefits

From Possum

Jump to: navigation, search

What will be the benefits of a file-based data standard?

Exchange
Using a unified data standard ensures the correct exchange of data between software packages. Different flat data files with little tagging information is sensitive to reading errors by software, resulting in error messages or even is regarded incorrect input of data! Each file in the POF (Portable Omics Format) format will contain an interface readable self-description of data types and layout. For software packages requiring different data types (e.g. QTL mapping) all data types are provided in a single file, rather than several separate files.


Wrapping and storage
A universal file format enables the clustering of limited data sets which have something in common. This can be all data generated within the context of a project, data collected from one or several databases with a certain goal or the raw data from published results. These data can be stored or distributed in a POF and maintain their project context.


Visualization
Data from a POF file can be visualized using the freely available omics reader software. For each defined data type or combination of data types, specific plug-ins (developed by either the POS∑ community or third parties) will provide visualization. Some examples of genetical visualization tools.


Additional features
The binary basis of the XML structured POF format enables additional features that will be implemented such as:

  • Data compression
  • Data signing
  • Data encryption
  • Storage of meta data
  • Storage of (analysis) process steps
  • Roll-back mechanisms in analysis steps


Applications
The products of the POS∑ project can be used in a wide range of applications and environments:

   - secure and consistent data exchange between software
   - data exchange between databases
   - saving combined results of database querying
   - systematic storage of all project data during phase of project carrying out
   - exchange of data in the academic world
   - secured and easy distribution of research service data sets in commercial environments


Personal tools