Example 1000 Genomes phase 1 dataset (16-Mar-2012 release); RefSeq coding variants extracted |
This datasetThis dataset represents the "whole-exome" subset of variants from the 1000 Genomes phase 1 (16-March-2012) release. General overview and instructionsThis table browser is designed to provide easy access to any PLINK/Seq project, which consists of one or more VCF files (with variant and genotype data, including extenisble, typed meta-data) and several core reference databases (gene, sequence and variant databases such as RefSeq, CCDS, hg18 human sequence, dbSNP, and the 1000 Genomes Project data). This particular browser is oriented towards exome studies, with a gene-centric focus. The main page contains the following fields: Gene IDThis is the main input field. Enter here either one or more gene symbol (e.g. ANK3), RefSeq NM number or a region (e.g. chr22:17000000..19000000). If there is more than one transcript for a gene, these will be listed, and you can select a specific one. You can click on list to get a list of all gene symbols in the database.The format should be 1 entry per row: either gene name, or NM number, or a range, with 1 or 2 base positions, e.g. chr1:12345..67890 ANK3 chr2:787882 NM_012345i.e. the 3rd row specifies a single position. If no variant exists here, it will be ignored. (In future, an rs-based lookup will be added.) Gene setThis specifies the gene/transcript database used by the browser. By default this is set to a RefSeq table, that is indexed by both NM numbers for transcripts as well as gene symbols. Do not edit this field for now.Variant meta-fieldsThese are the extensible set of fields that were present in the VCF files uploaded into the project. Click on list to see all available fields. If you type the keywords here, then they are appended in the table. For example, type (space delimited) proteinchange type DB(case sensitive) to get info on the nature of the variant, and a flag for dbSNP membership. To append all available variant meta-fields, enter an asterisk (*) in this box. PhenotypeIf phenotype data are available for the individuals, it can be specified here. Currently, only case/control phenotypes are supported. In the gene-table output, a case/control count will be appended to each variant (numbr of non-reference genotypes). In the individual-view, a CASE or CONTROL label will be appended next to each individual/genotype. Mask specificationAny valid mask statement can be specified here. Include filtersAn include filter can be specified here. This is equivalent to entering include="...expression..." in the mask field, but avoids potential issues in parsing the quoted expression, etc. OutputThis should hopefully be fairly self-explanatory. There are 3 main views:
|