Data Preparation

Download the templates here

and set up your data

Set up your data or let us do it for you.

If you know your data, it should be an easy task to do for yourself. Simply put your data into our template as instructed, and it should be ready to upload.

Outsource data preparation

If you would prefer to let us prepare your data, we are happy to do so. Simply, check the box “Purchase data preparation” and upload your datasheet.

4 steps to prepare data

1. Download three templates
2. Get an overview of your data
3. Insert your data into the templates
4. Save and upload your data

1. Download all three templates

2. Get an overview of your data

There are four factors we need to have in mind when setting up the three templates.

If you have unanswered questions after the step-through guide, please find the Frequently Asked Questions section at the bottom of this page.

Sample ID

A unique identifier for each sample.

Gene or Protein ID

Reference to the gene/protein, which expression has been measured.
We prefer Entrez gene identifiers or Uniprot identifier, examples: ENSMUSG00000090061, Q6P5U7, respectively. Yet gene names can also be used, like NWD2.

Sample Groups

Identify the sample groups you want to compare, like disease state or treatment state. Remember, we do not charge you per sample group, so feel free to include groups like gender. We want to ensure you get the most information from your data.

Replication

If some samples are technical or biological replicates.

3. Insert your data into the templates

To begin with open all three .csv-files on your computer. The templates are named as follows:

  • Expression - Biogenity.csv
  • Sample groups - Biogenity.csv
  • Replication - Biogenity.csv

  • Get onwards by examining the following steps.

    Compare Solutions
    Sample group template used in this example.

    Step 3.1


    Add ID groups for Expression and Sample Groups


    Expression data
    First, insert the gene or protein ID groups in the desired order in row 1.


    Sample Groups
    Insert your sample groups into the desired order in row 1.


    Replication
    Skip to the next step.


    If needed, just keep on adding new rows for when reaching the end of our template.

    Step 3.2


    Add Sample IDs


    Expression data
    Secondly, add all the Sample IDs from row A2 and below. Use the names of your sample IDs.


    Sample Groups
    Add all the sample IDs from row A2 and below. Use the names of your sample IDs.


    Replication
    Add all the Sample IDs from row A2 and down. Use the name of your sample IDs.


    Compare Solutions
    Sample group template used in this example.
    Compare Solutions
    Sample group template used in this example.

    Step 3.3


    Insert datapoints


    Expression data
    Thirdly, copy and insert all the quantifications into the gene/protein ID according to the Sample ID.


    Sample Groups
    Copy and insert all your specifications into the sample group according to the Sample ID.


    Replication
    Copy and insert all the specifications into the replicator-group according to the Sample ID.


    Lastly, make sure that each sample group matches every specification across the row correctly.

    4. Save and upload your data


    Save it as a .csv

    Choose product

    Upload each template

    And you are done.

    Your order will soon be processed.

    Frequently Asked Questions


    If you want to see an example of how a template would look when filled out, we have gathered a downloadable example for each template here:

  • Download example of the expression template
  • Download example of the sample group template
  • Download example of the replication template
  • The answer depends on the size of your data set. You can use commonly known software for small data sets of up to approx. 65,000 rows or 256 columns, such as Microsoft Office, Numbers (Mac), Open Office, etc.

    For large data sets that exceed 65.000 rows or 256 columns, we recommend using software capable of handling it, such as pyspread and Delimit.

    Types of data

    Currently, we mostly analyze gene and protein expressions but feel free to ask if you have other data to be processed. If it matches our technology, we can deliver.

    Gene Expressions: with classical gene identifiers, such as ENTREZID and gene name.

    Protein Expressions: with classical gene identifiers, such as UniProtKB AC/ID, ENTREZID and gene name.


    Formats

    Currently, we support the following formats:

    Prepare data yourself:

    .csv


    Let us handle the data preparation:

    .csv .txt .xlsx .xlsm .xls .xml


    Didn't you find the right answer?

    Don't worry. Feel free to contact us:

    Email us




    info@biogenity.com

    Call Customer Service




    (+45) 71116050

    Mon - Fri: 9 a.m. - 5 p.m. (GMT+1)

    Subscribe to our Newsletter


    Get updates about the development of new data analytic pipelines and new big data-based approached to discover and validate biomarkers, special offers and when we seek partners for new projects.

    How often do I get a Newsletter?

    We know how annoying overuse of newsletters can be - which is why we only send five a year.


    By subscribing you agree to our privacy policy.