Difference between revisions of "UK Biobank/Downloading the data"

From Vrieze Wiki
Jump to navigation Jump to search
(Created page with "# The phenotype file was downloaded from UK Biobank by the project PI as instructed in the data accessibility email. # All of the utilities from the UK Biobank [http://biobank...")
 
Line 1: Line 1:
# The phenotype file was downloaded from UK Biobank by the project PI as instructed in the data accessibility email.
+
These procedures were all derived from the [http://biobank.ctsu.ox.ac.uk/showcase/exinfo.cgi?src=accessing_data_guide documentation] at the UK Biobank.
# All of the utilities from the UK Biobank [http://biobank.ctsu.ox.ac.uk/showcase/download.cgi|download] page were retrieved.
+
 
# The key, k1234.key was saved from the PI's email.
+
== Phenotypic data ==
# These commands were run to decrypt the downloaded phenotype file
+
 
<nowiki>
+
<ol>
 +
<li>The phenotype file was downloaded from UK Biobank by the project PI as instructed in the data accessibility email.</li>
 +
<li>All of the utilities from the UK Biobank [http://biobank.ctsu.ox.ac.uk/showcase/download.cgi download] page were retrieved.</li>
 +
<li>The key, k1234.key was saved from the PI's email.</li>
 +
<li> This command was run to decrypt the downloaded phenotype file
 +
<pre>
 
$ ./ukb_unpack ukb1234.enc k1234.key
 
$ ./ukb_unpack ukb1234.enc k1234.key
</nowiki>
+
</pre>
which produced the file ukb1234.enc_ukb
+
which produced the file ukb1234.enc_ukb</li>
 +
<li>Once decrypted, the following commands were run to extract the data into useful formats
 +
<pre>
 +
$ ./ukb_conv ukb1234.enc_ukb bulk -eencoding.ukb
 +
$ ./ukb_conv ukb1234.enc_ukb docs -eencoding.ukb
 +
$ ./ukb_conv ukb1234.enc_ukb r -eencoding.ukb
 +
</pre>
 +
<ol>
 +
<li>bulk is a list of IDs for use with the ukbfetch utility</li>
 +
<li>docs produces an html file containing [https://ibg.colorado.edu/~lessem/ukb6395.html documentation of the variables] in this dataset</li>
 +
<li>r produces a tab deliminated file and an R script for labeling and putting levels on the variables.
 +
</ol>
 +
</li>
 +
</ol>
 +
 
 +
 
 +
== Genotypic data ==
 +
<ol>
 +
 
 +
</ol>

Revision as of 22:49, 19 February 2016

These procedures were all derived from the documentation at the UK Biobank.

Phenotypic data

  1. The phenotype file was downloaded from UK Biobank by the project PI as instructed in the data accessibility email.
  2. All of the utilities from the UK Biobank download page were retrieved.
  3. The key, k1234.key was saved from the PI's email.
  4. This command was run to decrypt the downloaded phenotype file
    $ ./ukb_unpack ukb1234.enc k1234.key
    
    which produced the file ukb1234.enc_ukb
  5. Once decrypted, the following commands were run to extract the data into useful formats
    $ ./ukb_conv ukb1234.enc_ukb bulk -eencoding.ukb
    $ ./ukb_conv ukb1234.enc_ukb docs -eencoding.ukb
    $ ./ukb_conv ukb1234.enc_ukb r -eencoding.ukb
    
    1. bulk is a list of IDs for use with the ukbfetch utility
    2. docs produces an html file containing documentation of the variables in this dataset
    3. r produces a tab deliminated file and an R script for labeling and putting levels on the variables.


Genotypic data