TP03_0202 cDNA ORF clone, Theileria parva strain Muguga

The following TP03_0202 gene cDNA ORF clone sequences were retrieved from the NCBI Reference Sequence Database (RefSeq). These sequences represent the protein coding region of the TP03_0202 cDNA ORF which is encoded by the open reading frame (ORF) sequence. ORF sequences can be delivered in our standard vector, pcDNA3.1+/C-(K)DYK or the vector of your choice as an expression/transfection-ready ORF clone. Not the clone you want? Click here to find your clone.

***CloneID Accession No. Definition **Vector *Turnaround time Price (USD) Select
OTl00121 XM_758127.1
Latest version!
Theileria parva strain Muguga chromosome 3 DNA polymerase alpha (TP03_0202) partial mRNA. pcDNA3.1-C-(k)DYK or customized vector 19-21 $727.30
$1039.00

ORF Online Only Promotion

Next-day Shipping ORF Clones ( in default vector with tag)
1 Clone 30% OFF
2-4 Clone 40% OFF
5 or more Clone 50% OFF
All Other ORF Clones
30% OFF

*Business Day

** You may select a custom vector to replace pcDNA3.1+/C-(K)DYK after clone is added to cart.

** GenScript guarantees 100% sequence accuracy of all synthetic DNA constructs we deliver, but we do not guarantee protein expression in your experimental system. Protein expression is influenced by many factors that may vary between experiments or laboratories. In addition, please pay attention to the signal peptide, propeptide and transit peptide in target ORF, which may affect the choice of vector (N/C terminal tag vector).

***One clone ID might be correlated to multiple accession numbers, which share the same CDS sequence.

  • Reference Sequences (Refseq)
    CloneID OTl00121
    Clone ID Related Accession (Same CDS sequence) XM_758127.1
    Accession Version XM_758127.1 Latest version! Documents for ORF clone product in default vector
    Sequence Information ORF Nucleotide Sequence (Length: 4356bp)
    Protein sequence
    SNP
    Vector pcDNA3.1-C-(k)DYK or customized vector User Manual
    Clone information Clone Map MSDS
    Tag on pcDNA3.1+/C-(K)DYK C terminal DYKDDDDK tags
    ORF Insert Method CloneEZ™ Seamless cloning technology
    Insert Structure linear
    Update Date 1223913600000
    Organism Theileria parva strain Muguga
    Product DNA polymerase alpha
    Comment Comment: PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. This record is derived from an annotated genomic sequence (NW_876245). COMPLETENESS: incomplete on the 5' end.

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    3841
    3901
    3961
    4021
    4081
    4141
    4201
    4261
    4321
    ATGGTAAAAA ACAAGAAAAA TGTGATCGAA TCAGCCCTTT CGAAATTAAG AAACCAGAGA 
    GAAGGGAAGG CAAACCTAAT CGATGAATAT CAAGTGGAAG ATGAGAACGA TCAGATCTAC
    GAGATAATAA CTGAGGAGGA ATTTAATGAG AGGAACAGGA AAAGAAAACT TGAAGATTTT
    ATAGAAGGGC CTGATTACAG TGACGATGAG GAGGATTTGG AGTTGTTAGA GGAACAAGTA
    GAGCGATTGG GGCGGAAGGC TTCTAAAGGT GTTCCTCATG GGAAATCTAT CCAACAACAC
    TTTATTGAGA TGGCTGCTCA GGACAGTCTT TCGTACCACC AACCAAAACC TACAGAAACA
    GACAAGGCTC TGTTGGAACG ATTGACAAAG TTTGAGGAGG AACTTGACAA TGATACTAAT
    GATTTAGGGC CAACTGTGTT ACAGAATCAG TTTGTAACTC CACAAATGTA TACACAACCA
    AGTTACCTAT CACAAGCACT AATGCAAGCA TCAGCTGAAG CACATTTACC GGAAAATTTA
    TTCAAAGCTT ATCAACCAGG AATCGCACTC GTTCATCAAT CTCAAGAAAT GGTTGAGGAA
    CCTAAGATAG ATTCGGCCAT ATTGGAAAGT TTTAGAGAAG TTGAAATGGA TGAAGATGTA
    GTTATCGATT CAGGACAATC TCCACCTCAA TATACTCAAC CTGATGCATT GACGTTGAGT
    GAGGATTTGG CGTTTTATTT ATTGGATTTA TGTGAAGATG GAGGATATTT ACAGTTATTT
    GGTCGAATTC GTACTAGTGG AACTGATACC GAAAGTTGTA TGGTAACTGT TAAGAACATG
    TATCGTTCGT TGTTCTTTAA GCCAAGGATG GACTTGCTGT TTAATGAGTT GGGAGAGGTT
    GTTGAGACAG CCTCTGATGG AGTAATTAAA AGCAGTGACC CACTCTATGA ACGTCACTTA
    ATGATGAACT TTTTCAACGA GTTTGAAGTC ATAAGAAAAG ACTATGGGAT CAAGAAAATT
    AAATATAAAC TTGTCAAGAG ATCACTACTT ACCTATGGCC CAACTAAGCC AGAATTTTAC
    ATTAAAGTTT GTTACCCGTT CCAATTTCAA ATTTTAAACA AACAACATCT AAGCGGAAAG
    TTTTATGAGG ATGTTTATCA CAACCATTCT ACACAAGTGG AATTATTTCT GTTGAAGAGG
    AGGATAAAGG GCCCAACGTG GTTGAGACTA ACTGATTACC AGAAATCAAC TGATAGTTTA
    TCATATTGTA AGGTTGAGAT TGAGATTGAC TCACACAAAA ATGTGCAACT CTGGCACTCT
    AAAAACAATG AAGAGCTTTT CACACCCAAA CTTAACATTC TTTCAATCTC AATCAAAACG
    CTTTTCACCA CACCAACTCA TCAAGAGGTG TTTATAATAA GTTGTGTGTA TAATAAGTAT
    AACATTGATG AGACTGCGAT GAAGGATAAT TATCAGTTTA TTGGTATTAG GAAGCCTCAG
    AATCTCCAAT GGCCAAATGA ACTCAAACAT TTCCTTGAAA AAAGGAACTA TTTCCGTATT
    TTCGAACAAG AAAGAGGACT ATTGGCTTAT TTCATGAACT ACCTCAGGGG AATCGATCCC
    GACGTGGTGA TTGGACATGA CATTCATCAG AACTGTGTTG ATCTACTGGT GAAACGTTGT
    AATTTACTCA ATATTCCATT GAGATTATCA CTTTCAAGGC TCAAACTGGT CAAGAAATCA
    CATTCACCAA TACTTTTTGG GAGATTATTT TGTGATACAA GATTGCTAAC CAAAGAATTA
    AATCCCTCTA AGGAAAATTA CAATTTGATA ACGTGTGTGA ATGACATATT GGAGATAAAG
    AATGTTGGAG ATCTGAATTT TTATAGCAGG TCAGCATTTA GCATGACTGA AATCCAAAAC
    CTTTTTGGTA AACCAGACTC GATTAAGCAC TTGCTTAAAC TACATTACAG TTTTCAGATA
    TTACCATTGA CGAAAGAATT GACAATAATC GCTGGAAACA CTTGGTCAAG AAGTATACAA
    TGTGCCAGGT CTGAGAGGAT TGATTTTTTG TTAATGCACG AGTTCTATCG CAATAAGTAC
    ATTCTCGATA ACGTCTTCAA CAAATTTAAA GAGCAAAGTA TTACCACACT AGGCTTTTAC
    TTCTATATAC ATTTTACTCA TCTGGATGAT ACGAAGAGTT ATGAGGGAGG GTTGGTGTTT
    GAGCCAATAT CGGGTTTGTA CGATAACTTC ATCCTGTTGC TTGATTTCAA CTCCTTGTAC
    CCGTCGATAA TCCAGGAATA TAACATTTGC TTTACGACAA CAGTTGCAAA TGGCGAGGAA
    AGCGTCGTGA TCCTGGACAA CGTTGGAGTC CTCCCATCGA TTCTTAAACG TCTAGTTGAA
    CTACGTCTTA ATATCAAGAA TATCATTAAA GGCGAGAAGA ACGAGACCAG AAGAGTCCAG
    CTTTCAACAA GACAACTGGC GTTGAAACTT ATCGCAAATT CTATTTACGG CTGCTTGGGA
    AGTAATTTTT CAAGATTTCA TTGCAAATAT ATCGCTTCTT ACATTACTAA ACTAGGTACT
    GCGTCGCGTG CAGGCAATTA CCCTATTCCT CATTTCACCA CTTATTTACA CAGTCGTGAG
    TTATTGAGGA GCACTAAGGA GAAGGTGGAG AATGTGTTTA ATTTACAAGT AATTTATGGT
    GACACAGATT CTTTGATGAT AAACACGAAC ATCCGGGATG ATGGGAATCT AACAAATTAC
    AATGCAGCAA ACCAACTTGC TAATAACCTT GTCACTTTCA TTAACAAATC ACACAAAAAG
    TTGGAAATCG GCATAGACGC TGTTTTCACA CGCTTGCTGC TACTCAAAAA GAAGAAATAC
    GCATCACTCA AAGTGGTCGA CTATGGGAGT GGACAATTTG AGAGAGAGAT AAAGGGGTTG
    GATTTTATTC GTCGTGACTG GTCACTTCTA ACAAAGGAAA TTGGAAACAA ATTGCTTAAC
    ATTATTTTAA ACTCAAATTA CTACGACGGA GTCGACGGAA TTGTACAAGA AATACACTCA
    ACGCTTATTA ATCTCAACGA ACAACTCAAT AATCAATCAA TTGAATTGAG TAAGTTTTTG
    ATAACGAAGC AGCTGACGAA GAACCCAAAA GAGTATAGCG ATGTGCAAAA CTTACCACAC
    GTCTCTGTGG CTCTAAGACT AAATGAAAAA GGGCTTGGAA ATTATTCAAC AGGACATGAA
    ATTTCATATA TCATATGCAC CAAATCGTCA GCAACTAAGT TCCACACGAA TACCACTAGT
    GACAAGGATA ATAGTGCTGA AAATAATGTT AATAGTAGTG GTAATATTGG TGGTAGTTTG
    AGTTTCAGAG CGTTTAGCTA TAATGAAGTA ATGGAGAATG GGTTGGAGGT TGACATAAGT
    TACTACAAAC AACAGCAGCT GTTACCTCCA ATATTACGTT TGTGTAGTAT AATTGAGGGT
    ACAGATATAC AACGGCTTTC CAGGTGTTTA CAAATCGAAA AGAGCATAGC CGTTACACAG
    GAGTACAATT ATGAACAAGA ATCTAAAGTT TTATCTCTAA TCAAAAGATC ACACGAGAAC
    TACAGAGACG TGGAGATAAA TTCTCAATTA TCATGTCAAC ATTGTAACGG CCCAGTTCTA
    CCCAGTTTCT TCCTCAAATA TTTTAAGTGT AATCATTGTC TGAGGTGGTT ACCGTTACAT
    TTATTGAGGA ATTGGGTTGA CCGTTTGTTA TACGAATTAA CAGTGCAATC ATCTTTCTGT
    ATTAGAGTCT GTAACATCTG TAATGTTACC ACACTCAACG TAACTCTAGG AGATGTTGAC
    AGATGCCCAC AACCAACGTG TCAATCTAAC GACTCTATGC AAACCATTTT CACATCAAAC
    AAAGTCTACA TGTATTACGA TTATTTGGTG TATATGCTGG AGGGCAAATT AAATAATCCT
    CTAAAAGATA CCGAGACGAA TAATACTACA CAGACCAGTG CTGCTGCTAA TAATACAGAG
    GAAAATGAGG AAAATTTGGT GAATGTGATG ATTGATCTTG ATGGGAAGTT GACAATATTG
    TATGATGAGC CATTCGGAGA AGTACGAACA TTTGATGAAG TTATTGATGA AGTGACGAGA
    GGAAGCGCAG TGAGAGCAAG CTCAGCCCTA AGGCTGTGCG CGCAGCACAT CATAGGACTA
    ATGGAGGCAA TCCCGTATCT GCGCTACTAC ACATTGGACT ACCAGAAGGA GAGAGAAATT
    CTCTGTAATC GAGTTAAAAC ATTACAGCTT AAAAATTCAT ATAGCGTTGT GGATTTATCA
    CAACTCTTCC ATCTACTCTC CCCATTCAGT AATTAA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

    RefSeq XP_763220.1
    CDS1..4356
    Translation

    Target ORF information:

    RefSeq Version XM_758127.1
    Organism Theileria parva strain Muguga
    Definition Theileria parva strain Muguga chromosome 3 DNA polymerase alpha (TP03_0202) partial mRNA.

    Target ORF information:

    Epitope DYKDDDDK
    Bacterial selection AMPR
    Mammalian selection NeoR
    Vector pcDNA3.1+/C-(K)DYK
    XM_758127.1

    ORF Insert Sequence:

    1
    61
    121
    181
    241
    301
    361
    421
    481
    541
    601
    661
    721
    781
    841
    901
    961
    1021
    1081
    1141
    1201
    1261
    1321
    1381
    1441
    1501
    1561
    1621
    1681
    1741
    1801
    1861
    1921
    1981
    2041
    2101
    2161
    2221
    2281
    2341
    2401
    2461
    2521
    2581
    2641
    2701
    2761
    2821
    2881
    2941
    3001
    3061
    3121
    3181
    3241
    3301
    3361
    3421
    3481
    3541
    3601
    3661
    3721
    3781
    3841
    3901
    3961
    4021
    4081
    4141
    4201
    4261
    4321
    ATGGTAAAAA ACAAGAAAAA TGTGATCGAA TCAGCCCTTT CGAAATTAAG AAACCAGAGA 
    GAAGGGAAGG CAAACCTAAT CGATGAATAT CAAGTGGAAG ATGAGAACGA TCAGATCTAC
    GAGATAATAA CTGAGGAGGA ATTTAATGAG AGGAACAGGA AAAGAAAACT TGAAGATTTT
    ATAGAAGGGC CTGATTACAG TGACGATGAG GAGGATTTGG AGTTGTTAGA GGAACAAGTA
    GAGCGATTGG GGCGGAAGGC TTCTAAAGGT GTTCCTCATG GGAAATCTAT CCAACAACAC
    TTTATTGAGA TGGCTGCTCA GGACAGTCTT TCGTACCACC AACCAAAACC TACAGAAACA
    GACAAGGCTC TGTTGGAACG ATTGACAAAG TTTGAGGAGG AACTTGACAA TGATACTAAT
    GATTTAGGGC CAACTGTGTT ACAGAATCAG TTTGTAACTC CACAAATGTA TACACAACCA
    AGTTACCTAT CACAAGCACT AATGCAAGCA TCAGCTGAAG CACATTTACC GGAAAATTTA
    TTCAAAGCTT ATCAACCAGG AATCGCACTC GTTCATCAAT CTCAAGAAAT GGTTGAGGAA
    CCTAAGATAG ATTCGGCCAT ATTGGAAAGT TTTAGAGAAG TTGAAATGGA TGAAGATGTA
    GTTATCGATT CAGGACAATC TCCACCTCAA TATACTCAAC CTGATGCATT GACGTTGAGT
    GAGGATTTGG CGTTTTATTT ATTGGATTTA TGTGAAGATG GAGGATATTT ACAGTTATTT
    GGTCGAATTC GTACTAGTGG AACTGATACC GAAAGTTGTA TGGTAACTGT TAAGAACATG
    TATCGTTCGT TGTTCTTTAA GCCAAGGATG GACTTGCTGT TTAATGAGTT GGGAGAGGTT
    GTTGAGACAG CCTCTGATGG AGTAATTAAA AGCAGTGACC CACTCTATGA ACGTCACTTA
    ATGATGAACT TTTTCAACGA GTTTGAAGTC ATAAGAAAAG ACTATGGGAT CAAGAAAATT
    AAATATAAAC TTGTCAAGAG ATCACTACTT ACCTATGGCC CAACTAAGCC AGAATTTTAC
    ATTAAAGTTT GTTACCCGTT CCAATTTCAA ATTTTAAACA AACAACATCT AAGCGGAAAG
    TTTTATGAGG ATGTTTATCA CAACCATTCT ACACAAGTGG AATTATTTCT GTTGAAGAGG
    AGGATAAAGG GCCCAACGTG GTTGAGACTA ACTGATTACC AGAAATCAAC TGATAGTTTA
    TCATATTGTA AGGTTGAGAT TGAGATTGAC TCACACAAAA ATGTGCAACT CTGGCACTCT
    AAAAACAATG AAGAGCTTTT CACACCCAAA CTTAACATTC TTTCAATCTC AATCAAAACG
    CTTTTCACCA CACCAACTCA TCAAGAGGTG TTTATAATAA GTTGTGTGTA TAATAAGTAT
    AACATTGATG AGACTGCGAT GAAGGATAAT TATCAGTTTA TTGGTATTAG GAAGCCTCAG
    AATCTCCAAT GGCCAAATGA ACTCAAACAT TTCCTTGAAA AAAGGAACTA TTTCCGTATT
    TTCGAACAAG AAAGAGGACT ATTGGCTTAT TTCATGAACT ACCTCAGGGG AATCGATCCC
    GACGTGGTGA TTGGACATGA CATTCATCAG AACTGTGTTG ATCTACTGGT GAAACGTTGT
    AATTTACTCA ATATTCCATT GAGATTATCA CTTTCAAGGC TCAAACTGGT CAAGAAATCA
    CATTCACCAA TACTTTTTGG GAGATTATTT TGTGATACAA GATTGCTAAC CAAAGAATTA
    AATCCCTCTA AGGAAAATTA CAATTTGATA ACGTGTGTGA ATGACATATT GGAGATAAAG
    AATGTTGGAG ATCTGAATTT TTATAGCAGG TCAGCATTTA GCATGACTGA AATCCAAAAC
    CTTTTTGGTA AACCAGACTC GATTAAGCAC TTGCTTAAAC TACATTACAG TTTTCAGATA
    TTACCATTGA CGAAAGAATT GACAATAATC GCTGGAAACA CTTGGTCAAG AAGTATACAA
    TGTGCCAGGT CTGAGAGGAT TGATTTTTTG TTAATGCACG AGTTCTATCG CAATAAGTAC
    ATTCTCGATA ACGTCTTCAA CAAATTTAAA GAGCAAAGTA TTACCACACT AGGCTTTTAC
    TTCTATATAC ATTTTACTCA TCTGGATGAT ACGAAGAGTT ATGAGGGAGG GTTGGTGTTT
    GAGCCAATAT CGGGTTTGTA CGATAACTTC ATCCTGTTGC TTGATTTCAA CTCCTTGTAC
    CCGTCGATAA TCCAGGAATA TAACATTTGC TTTACGACAA CAGTTGCAAA TGGCGAGGAA
    AGCGTCGTGA TCCTGGACAA CGTTGGAGTC CTCCCATCGA TTCTTAAACG TCTAGTTGAA
    CTACGTCTTA ATATCAAGAA TATCATTAAA GGCGAGAAGA ACGAGACCAG AAGAGTCCAG
    CTTTCAACAA GACAACTGGC GTTGAAACTT ATCGCAAATT CTATTTACGG CTGCTTGGGA
    AGTAATTTTT CAAGATTTCA TTGCAAATAT ATCGCTTCTT ACATTACTAA ACTAGGTACT
    GCGTCGCGTG CAGGCAATTA CCCTATTCCT CATTTCACCA CTTATTTACA CAGTCGTGAG
    TTATTGAGGA GCACTAAGGA GAAGGTGGAG AATGTGTTTA ATTTACAAGT AATTTATGGT
    GACACAGATT CTTTGATGAT AAACACGAAC ATCCGGGATG ATGGGAATCT AACAAATTAC
    AATGCAGCAA ACCAACTTGC TAATAACCTT GTCACTTTCA TTAACAAATC ACACAAAAAG
    TTGGAAATCG GCATAGACGC TGTTTTCACA CGCTTGCTGC TACTCAAAAA GAAGAAATAC
    GCATCACTCA AAGTGGTCGA CTATGGGAGT GGACAATTTG AGAGAGAGAT AAAGGGGTTG
    GATTTTATTC GTCGTGACTG GTCACTTCTA ACAAAGGAAA TTGGAAACAA ATTGCTTAAC
    ATTATTTTAA ACTCAAATTA CTACGACGGA GTCGACGGAA TTGTACAAGA AATACACTCA
    ACGCTTATTA ATCTCAACGA ACAACTCAAT AATCAATCAA TTGAATTGAG TAAGTTTTTG
    ATAACGAAGC AGCTGACGAA GAACCCAAAA GAGTATAGCG ATGTGCAAAA CTTACCACAC
    GTCTCTGTGG CTCTAAGACT AAATGAAAAA GGGCTTGGAA ATTATTCAAC AGGACATGAA
    ATTTCATATA TCATATGCAC CAAATCGTCA GCAACTAAGT TCCACACGAA TACCACTAGT
    GACAAGGATA ATAGTGCTGA AAATAATGTT AATAGTAGTG GTAATATTGG TGGTAGTTTG
    AGTTTCAGAG CGTTTAGCTA TAATGAAGTA ATGGAGAATG GGTTGGAGGT TGACATAAGT
    TACTACAAAC AACAGCAGCT GTTACCTCCA ATATTACGTT TGTGTAGTAT AATTGAGGGT
    ACAGATATAC AACGGCTTTC CAGGTGTTTA CAAATCGAAA AGAGCATAGC CGTTACACAG
    GAGTACAATT ATGAACAAGA ATCTAAAGTT TTATCTCTAA TCAAAAGATC ACACGAGAAC
    TACAGAGACG TGGAGATAAA TTCTCAATTA TCATGTCAAC ATTGTAACGG CCCAGTTCTA
    CCCAGTTTCT TCCTCAAATA TTTTAAGTGT AATCATTGTC TGAGGTGGTT ACCGTTACAT
    TTATTGAGGA ATTGGGTTGA CCGTTTGTTA TACGAATTAA CAGTGCAATC ATCTTTCTGT
    ATTAGAGTCT GTAACATCTG TAATGTTACC ACACTCAACG TAACTCTAGG AGATGTTGAC
    AGATGCCCAC AACCAACGTG TCAATCTAAC GACTCTATGC AAACCATTTT CACATCAAAC
    AAAGTCTACA TGTATTACGA TTATTTGGTG TATATGCTGG AGGGCAAATT AAATAATCCT
    CTAAAAGATA CCGAGACGAA TAATACTACA CAGACCAGTG CTGCTGCTAA TAATACAGAG
    GAAAATGAGG AAAATTTGGT GAATGTGATG ATTGATCTTG ATGGGAAGTT GACAATATTG
    TATGATGAGC CATTCGGAGA AGTACGAACA TTTGATGAAG TTATTGATGA AGTGACGAGA
    GGAAGCGCAG TGAGAGCAAG CTCAGCCCTA AGGCTGTGCG CGCAGCACAT CATAGGACTA
    ATGGAGGCAA TCCCGTATCT GCGCTACTAC ACATTGGACT ACCAGAAGGA GAGAGAAATT
    CTCTGTAATC GAGTTAAAAC ATTACAGCTT AAAAATTCAT ATAGCGTTGT GGATTTATCA
    CAACTCTTCC ATCTACTCTC CCCATTCAGT AATTAA

    The stop codons will be deleted if pcDNA3.1+/C-(K)DYK vector is selected.

  • PubMed

    Genome sequence of Theileria parva, a bovine pathogen that transforms lymphocytes.
    Science (New York, N.Y.)309(5731)134-7(2005 Jul)
    Gardner MJ,Bishop R,Shah T,de Villiers EP,Carlton JM,Hall N,Ren Q,Paulsen IT,Pain A,Berriman M,Wilson RJ,Sato S,Ralph SA,Mann DJ,Xiong Z,Shallom SJ,Weidman J,Jiang L,Lynn J,Weaver B,Shoaibi A,Domingo AR,Wasawo D,Crabtree J,Wortman JR,Haas B,Angiuoli SV,Creasy TH,Lu C,Suh B,Silva JC,Utterback TR,Feldblyum TV,Pertea M,Allen J,Nierman WC,Taracha EL,Salzberg SL,White OR,Fitzhugh HA,Morzaria S,Venter JC,Fraser CM,Nene V