Files

File formats

Note

All the files can be compressed using GZIP (extension “.gz”), BZIP2 (extension “.bz2”) or LZMA (extension “.xz”)

Input file format

The variants file is a text file with, at least, 5 columns separated by a tab character (the header is required, but the order of the columns can change):

  • Column CHROMOSOME: Chromosome. A number between 1 and 22 or the letter X or Y (upper case)
  • Column POSITION: Mutation position. A positive integer.
  • Column REF: Reference allele. A single letter: A, C, G or T (upper case)
  • Column ALT: Alternate allele. A single letter: A, C, G or T (upper case)
  • Column SAMPLE: Sample identifier. Any alphanumeric string.
  • Column CANCER_TYPE: Cancer type. Any alphanumeric string. Optional.
  • Column SIGNATURE: User defined signature categories. Any alphanumeric string. Optional.

Regions file format

The regions file is a text file with, at least, 4 columns separated by a tab character (the column order must be preserved):

  • Column 1 [CHROMOSOME]: Chromosome. A number between 1 and 22 or the letter X or Y (upper case)
  • Column 2 [START]: Start position. A positive integer.
  • Column 3 [STOP]: End position. A positive integer.
  • Column 4 [STRAND]: Strand: + for positive, - for negative, . for unknown.
  • Column 5 [ELEMENT]: Element identifier.
  • Column 6 [SEGMENT]: Segment identifier. Optional column.
  • Column 7 [SYMBOL]: Symbol, a different identifier for the element that will also be printed in the output file. Optional column.

Output file format

OncodriveFML generates a tabulated file with the results with the extension “.tsv”.

Check the output section to find a detailed description regarding the output.