Monarch geneset OGS2.0

DPOGS200942
TranscriptDPOGS200942-TA1461 bp
ProteinDPOGS200942-PA486 aa
Genomic positionDPSCF300215 - 177544-196742
RNAseq coverage52x (Rank: top 70%)
Annotation
HeliconiusHMEL0074957e-10377.78% 
BombyxBGIBMGA010206-TA0.076.30% 
Drosophiladys-PD9e-7661.58% 
EBI UniRef50UniRef50_E0W4046e-9549.13%Hypoxia-inducible factor 1 alpha, putative n=1 Tax=Pediculus humanus corporis RepID=E0W404_PEDHC
NCBI RefSeqXP_002433098.11e-9549.13%hypoxia-inducible factor 1 alpha, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420253712e-9449.13%hypoxia-inducible factor 1 alpha, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420253711e-10647.33%hypoxia-inducible factor 1 alpha, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00055153e-05protein binding
KEGG pathwayxtr:4481175e-07 
 K09095 (HIF2A, EPAS1)maps-> Pathways in cancer
    Renal cell carcinoma
Orthology groupMCL25715 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200942-TA
ATGTCAAAGCTGCCCGCCCATGAGTTGAAGCCAGACGTGGTGTTGGTACGTGGTCATTTCGTGTCCTATCTTCCACTGTGCAGTCGCAACGAGCCAGTCTTCCTTGCTGCCTGCACACCCCTCGCCATGCCGGAAACCAGAGAATGCGTTGTGCACGGCGCCACTAACGTCTTCACTACAGTACACGCGATGGACATGAAAATACTGCATATAGATACCAATAGCGAGTGGCACCTCGGCTGGGATAAAAACTCGCTCCACGGAGTCAGCTGGTACCACCTCCTACATCCAGATTGCTGTAAAGAAGCGCAGAACAAACATAGACTTATAACCCAATCAGAGCAAGAGCGATCGTGCATCGCGTTATTGCGCCTCCAGCGGCGATGTGGCCAATTCCTCTGGGTTCATGCGGTTCTCCAGCTCAAAGACAACCTGGAGAACTCTCAGAGGCCCGTCATCGTGTGCACCAACCAGGTTTTGAGTGAACAAGAAGCAGCTGTGATGAAAGCAAATCCGTGGCTTTATCACTATTATGCGGTTCAATCTAAACTTCATTATGGTTTGGCTTATGAAGCGGCCACCAGGATGTATGGACCTCCCCCGCCAGATATCCAGGTCTACCCTCCAACAATGCAATATACACCGACTCCCGCGGTTTGTCTTAATGGAAACCAAGCTATGATGATGCCAAATGCTGGTCTTCAGAATCACATCCCTCAAGTTCCTCCACACTTAACACCTATTCATTACGCTTACGAAAGACTTGATAATGGGCCAGTAGACTATTCAGTTTCGCTTCAAGAAAACAGGGGATATCATAATATACGGATAGAAGATATCAGGTCTCCTGAAACGAAAAGAAAAGTGAAACAGGAGGAAGATAAAACTAGCGTACCTAATATATCGCCTAGATTTTCATCCGCTGCTGGGGACACTTTAGTCGCCATAGCAGCAACAACGGTTGCTACACGTAGGCCAGGCTCTAAAAATTTTAATATCACTGAAACGGAGATGATAGATCAATGGAATCCTAGTCCCACATGGTCGGAATCAGCTCTGCAGAAAGTTCCAGATGTTTGTCACCAGGAGTTAAGTCCATATGCAAACACAACTCCTCCGACACCTTCAGGAACACCATCAGCCTACACTCACAACTATAGCAATGCATTCGTGTTCGACTGGTCGCCAGAGCAATACGTTCCTTATACTAACAATAGATGTAGTACCGTAACAACAGAAAGTTCCTATCAACAAAATTGGAATAGAAATCATAGAACGACATTTGTTAGTGCAAATGATAGTTCCGCCGACGAAAGTTGTACTTCTAACAGGTTAAACCTTCCGGCGAGGGTTAGTCGAACGCCGCCAGACCATAACTCGTCAAGAGAAGACGAAAGCCTTAAGAGGCAAATGGAACATCTTAATCCTGACTCTCCGTGTGCTAAAAAACCAGCGTTATAG

Protein sequence:

>DPOGS200942-PA
MSKLPAHELKPDVVLVRGHFVSYLPLCSRNEPVFLAACTPLAMPETRECVVHGATNVFTTVHAMDMKILHIDTNSEWHLGWDKNSLHGVSWYHLLHPDCCKEAQNKHRLITQSEQERSCIALLRLQRRCGQFLWVHAVLQLKDNLENSQRPVIVCTNQVLSEQEAAVMKANPWLYHYYAVQSKLHYGLAYEAATRMYGPPPPDIQVYPPTMQYTPTPAVCLNGNQAMMMPNAGLQNHIPQVPPHLTPIHYAYERLDNGPVDYSVSLQENRGYHNIRIEDIRSPETKRKVKQEEDKTSVPNISPRFSSAAGDTLVAIAATTVATRRPGSKNFNITETEMIDQWNPSPTWSESALQKVPDVCHQELSPYANTTPPTPSGTPSAYTHNYSNAFVFDWSPEQYVPYTNNRCSTVTTESSYQQNWNRNHRTTFVSANDSSADESCTSNRLNLPARVSRTPPDHNSSREDESLKRQMEHLNPDSPCAKKPAL-