Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
List of file formats
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
== Scientific data (data exchange) == * [[FITS]] β Flexible Image Transport System, a standard data format for astronomy * [[Silo (library)|Silo]] β a storage format for visualization developed at [[Lawrence Livermore National Laboratory]] * SPC β [[SPC file format|SPC]], spectroscopic data * [[EAS3]] β binary format for structured data * [[EOSSA]] β Electro-Optic Space Situational Awareness format * OST β (Open Spatio-Temporal) extensible, mainly images with related data, or just pure data; meant as an open alternative for microscope images * CCP4 β [[CCP4 (file format)|CCP4]], X-ray crystallography voxels (electron density) * MRC β [[MRC (file format)|MRC]], voxels in cryo-electron microscopy * [[HITRAN]] β spectroscopic data with one optical/infrared transition per line in the ASCII file (.hit) * ROOT β hierarchical platform-independent compressed binary format used by [[ROOT]] * SDF β [[Simple Data Format|Simple Data Format (SDF)]], a platform-independent, precision-preserving binary data I/O format capable of handling large, multi-dimensional arrays. * MYD β Everfine LEDSpec software file for LED measurements * CSDM β (Core Scientific Dataset Model) model for multi-dimensional and correlated datasets from various spectroscopies, diffraction, microscopy, and imaging techniques (.csdf, .csdfe).<ref> {{cite journal |last1= Srivastava |first1= Deepansh |last2= Vosegaard|first2=Thomas | last3 = Massiot | first3 = Dominique | last4 = Grandinetti | first4 = Philip | author-link4=Philip Grandinetti |date=January 2020 |title=Core Scientific Dataset Model: A lightweight and portable model and file format for multi- dimensional scientific data |journal= PLOS ONE |volume= 15 |issue=1 |pages= e0225953 |doi=10.1371/journal.pone.0225953 |pmid= 31895936 |pmc= 6940021 |bibcode= 2020PLoSO..1525953S |doi-access= free }} </ref> === Multi-domain === * [[NetCDF]] β Network common data format * HDR, HDF, h4, h5 β [[Hierarchical Data Format]] * SDXF β [[SDXF]], (Structured Data Exchange Format) * CDF β [[Common Data Format]] * CGNS β [[CGNS]], CFD General Notation System * [https://dx.doi.org/10.1016/j.cpc.2009.11.014 FMF] β Full-Metadata Format === Meteorology === * [[GRIB]] β Grid in Binary, WMO format for weather model data * [[BUFR]] β WMO format for weather observation data * [[PP-format|PP]] β UK Met Office format for weather model data * NASA-Ames β Simple text format for observation data. First used in aircraft studies of the atmosphere. === Chemistry === {{Main|chemical file format}} * CML β [[Chemical Markup Language]] (CML) (.cml) * MOL, SD, SDF β [[Chemical table file]] (CTab) * DX, JDX β [[Joint Committee on Atomic and Molecular Physical Data]] (JCAMP) * SMI β [[Simplified molecular input line entry specification]] (SMILES) === Mathematics === * G6, S6 β graph6, sparse6, ASCII encoding of [[Adjacency matrix|Adjacency matrices]] === Biology === Molecular biology and bioinformatics: * AB1 β In [[DNA sequencing]], [[chromatogram]] files used by instruments from [[Applied Biosystems]] * ACE β A [[sequence assembly]] format * ASN.1 β [[Abstract Syntax Notation One]], is an International Standards Organization ([[International Organization for Standardization|ISO]]) data representation format used to achieve interoperability between platforms. [[National Center for Biotechnology Information|NCBI]] uses ASN.1 for the storage and retrieval of data such as nucleotide and protein sequences, structures, genomes, and PubMed records. * BAM β [[Binary Alignment/Map]] format (compressed SAM format) * BCF β Binary compressed VCF format * BED β The [[BED file format|browser extensible display format]] is used for describing [[gene]]s and other features of [[DNA]] sequences * CAF β Common Assembly Format for [[sequence assembly]] * [[CRAM (file format)|CRAM]] β compressed file format for storing biological sequences aligned to a reference sequence * DDBJ β The flatfile format used by the [[DNA Data Bank of Japan|DDBJ]] to represent database records for [[nucleotide sequence|nucleotide]] and [[peptide sequence]]s from [[DDBJ]] databases. * EMBL β The flatfile format used by the [[European Molecular Biology Laboratory|EMBL]] to represent database records for [[nucleotide sequence|nucleotide]] and [[peptide sequence]]s from [[EMBL]] databases. * FASTA β The [[FASTA format]], for sequence data. Sometimes also given as FNA or FAA (Fasta Nucleic Acid or Fasta Amino Acid). * FASTQ β The [[FASTQ format]], for sequence data with quality. Sometimes also given as QUAL. * GCPROJ β The [[Genome Compiler]] project. Advanced format for genetic data to be designed, shared and visualized. * GenBank β The flatfile format used by the [[National Center for Biotechnology Information|NCBI]] to represent database records for [[nucleotide sequence|nucleotide]] and [[peptide sequence]]s from the [[GenBank]] and [[RefSeq]] databases * GFF β The [[General feature format]] is used to describe [[gene]]s and other features of [[DNA]], [[RNA]], and [[protein]] sequences * GTF β The [[Gene transfer format]] is used to hold information about [[gene]] structure * MAF β The [[Multiple Alignment Format]] stores multiple alignments for whole-genome to whole-genome comparisons [https://biopython.org/wiki/Multiple_Alignment_Format] * NCBI β Structured [[ASN.1]] format used at [[National Center for Biotechnology Information]] for DNA and protein data * NEXUS β The [[Nexus file]] encodes mixed information about genetic sequence data in a block structured format * NeXML β XML format for [[phylogenetic tree]]s * NWK β The [[Newick format|Newick tree format]] is a way of representing graph-theoretical trees with edge lengths using parentheses and commas and useful to hold [[phylogenetic tree]]s. * PDB β structures of biomolecules deposited in [[Protein Data Bank]], also used to exchange protein and nucleic acid structures * PHD β Phred output, from the base-calling software [[Phred (software)|Phred]] * PLN β Protein Line Notation used in [http://www.biochemfusion.com/products/ proteax software] [http://www.biochemfusion.com/doc/Biochemfusion_PLN_1.4_spec.pdf specification] * SAM β [[SAM (file format)|SAM]], Sequence Alignment Map format, in which the results of the [[1000 Genomes Project]] will be released * SBML β [[SBML|The Systems Biology Markup Language]] is used to store biochemical network computational models * SCF β Staden chromatogram files used to store data from [[DNA sequencing]] * SFF β [[Standard Flowgram Format]] * SRA β format used by the [[National Center for Biotechnology Information]] Short Read Archive to store high-throughput DNA sequence data * Stockholm β The [[Stockholm format]] for representing [[multiple sequence alignment]]s * Swiss-Prot β The flatfile format used to represent database records for [[peptide sequence|protein]] sequences from the [[Swiss-Prot]] database * VCF β [[Variant Call Format]], a standard created by the [[1000 Genomes Project]] that lists and annotates the entire collection of human variants (with the exception of approximately 1.6 million variants). === Biomedical imaging === * DCM β [[Digital Imaging and Communications in Medicine]] (DICOM) * NIfTI β [[Neuroimaging Informatics Technology Initiative]] * NII β single-file (combined data and meta-data) style * NII.GZ β [[gzip]]-compressed, used transparently by some software, notably the [[FMRIB Software Library]] (FSL) * GII β single-file (combined data and meta-data) style; NIfTI offspring for brain surface data * IMG, HDR β dual-file (separate data and meta-data, respectively) style * IMG, HDR β [[Analyze (imaging software)|Analyze]] data, meta-data * BRIK, HEAD β [[Afni|AFNI]] data, meta-data * MGH β uncompressed, [[Massachusetts General Hospital]] imaging format, used by the [[FreeSurfer]] brain analysis package * MGZ β [[ZIP (file format)|zip]]-compressed, [[Massachusetts General Hospital]] imaging format, used by the [[FreeSurfer]] brain analysis package * MINC β [[Medical Imaging NetCDF]] format * MNC β previously based on NetCDF; since version 2.0, based on HDF5 === Biomedical signals (time series) === * ACQ β AcqKnowledge format for Windows/PC from Biopac Systems Inc., Goleta, CA, USA * ADICHT β LabChart format from ADInstruments Pty Ltd, Bella Vista NSW, Australia * [[BCI2000]] β The [[BCI2000]] project, Albany, NY, USA * BDF β BioSemi data format from BioSemi B.V. Amsterdam, Netherlands * BKR β The EEG data format developed at the University of Technology Graz, Austria * CFWB β Chart Data Format from ADInstruments Pty Ltd, Bella Vista NSW, Australia * [[DICOM]] β Waveform An extension of Dicom for storing waveform data * ecgML β A markup language for electrocardiogram data acquisition and analysis * EDF, EDF+ β [[European Data Format]] * FEF β File Exchange Format for Vital signs, CEN TS 14271 * [https://arxiv.org/abs/cs/0608052 GDF] β The General Data Format for biomedical signals * HL7aECG β Health Level 7 v3 annotated ECG * MFER β Medical waveform Format Encoding Rules * OpenXDF β Open Exchange Data Format from Neurotronics, Inc., Gainesville, FL, USA * [[SCP-ECG]] β Standard Communication Protocol for Computer assisted electrocardiography EN1064:2007 * SIGIF β A digital SIGnal Interchange Format with application in neurophysiology * WFDB β Format of Physiobank * [https://code.google.com/p/xdf/ XDF] β eXtensible Data Format === Other biomedical formats === * HL7 β [[Health Level 7]], a framework for exchange, integration, sharing, and retrieval of health information electronically * [[xDT]] β a family of data exchange formats for medical records === Biometric formats === * [[CBF (file format)|CBF]] β Common Biometric Format, based on CBEFF 2.0 (Common Biometric ExFramework). * [[EBF (file format)|EBF]] β Extended Biometric Format, based on CBF but with [[S/MIME]] encryption support and semantic extensions * [[CBFX (file format)|CBFX]] β XML Common Biometric Format, based upon XCBF 1.1 (OASIS XML Common Biometric Format) * [[EBFX]] β XML Extended Biometric Format, based on CBFX but with W3C XML Encryption support and semantic extensions
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)