Monarch geneset OGS2.0

DPOGS200963
TranscriptDPOGS200963-TA1614 bp
ProteinDPOGS200963-PA537 aa
Genomic positionDPSCF300215 + 287887-296411
RNAseq coverage3x (Rank: top 90%)
Annotation
HeliconiusHMEL0039690.068.11% 
BombyxBGIBMGA010167-TA0.074.29% 
Drosophiladys-PD2e-14664.41% 
EBI UniRef50UniRef50_Q297A13e-14664.65%GA16931 n=3 Tax=Neoptera RepID=Q297A1_DROPS
NCBI RefSeqXP_001990638.18e-15067.80%GH18137 [Drosophila grimshawi]
NCBI nr blastpgi|2700149471e-14972.03%hypothetical protein TcasGA2_TC013566 [Tribolium castaneum]
NCBI nr blastxgi|2700149478e-14572.03%hypothetical protein TcasGA2_TC013566 [Tribolium castaneum]
Group
KEGG pathwayxla:4441454e-19 
 K09095 (HIF2A, EPAS1)maps-> Pathways in cancer
    Renal cell carcinoma
Orthology groupMCL13931 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200963-TA
ATGTTTTTAAGATTCGAGCCAACCAAATCGACGAAAGGCGCGAGCAAGATGCGTCGCGATCTCATCAACGCTGAGATATCCAACCTCCGCGATCTGCTACCCCTACCACCGTCCACAAGACAAAGACTGTCACAGCTGCAACTAATGGCGCTGGTCTGTGTGTACGTCAGGAAAATGAATTACTTCCAACAAGTGTTCAAGAGTCACGACTTTAGTTATCAGTACCAAGAGCAACCTACTCCTACACCAAATATCGGATTTTCAAAGGCAATGAATGGTTTTATGATGATGATGACACAGAACGGAAAACTGTTGTATATATCAGAGAATGCTGCGGAATATTTAGGACATTCTATGGAGGATCTTCTGATTCATGGCGATAGCGTTTACGATATCATTGACAGACAAGACCACCAGATGGTTCAACTTGAATTGAACAGAACTAGCAACAGTGAGACTGAAAATGTACCAAAGAATAGACATTTCTTCTGCAGAATGAACGTCTCAAGAAATGCGAGACGCCAAATGAGATTTGGTGACCAAAAAGTAGTTTTAGTTCAAGGGCATTATGTGTCATATTTGCCGCTTTGTAGCCGCAATGAGCCGGTGTTCCTAGCATCATGTACGCCTCTGGCTATGCCTGAAACAAGGGAATGTATAGTTCATGGAGCCACGAATGTGTTCACTACCATACATTCGATGGACATGAAGATATTACACATAGATACCAACGGTGAATGGTACTTAGGTTGGAAGAAAACTGATCTGATAGACGTGTCGTGGTATCAAATACTACATTGGGATAGTTTAAGAGAAGCTCAAACCAAACATAGATTAAGCAATATTACCCAATCGGAGCAAGATAAATGTTGCATCCTCTTAGTAAAGCTACAGCAAAGAAGCGGTTCGTTCCTATGGATACATATGGTGCTGCAAGTCAAAGAGGCTGCGGATGCTCCCAGGCAGTTCATTGTAGCTACTAACCAAGTCTTAAGTGAAGAAGAAGCATCTATAATGGTGTCCAATTCATGGTTATATCAATATTATGGATACCAGAACCAGAATTGTGGGATGCTCGACCCCAGATGTCAAAAGTTCTTTAGAAGAGAACCATATTACCCTGATCCCTATCCAGAATACCAATATGTAGAAAACGAGATCGATTACACTGGTTACCACATAACGCCGTATGTTTGTCAAAAAGCTGACGATTATGGCTGTGAAAGAATATACACTGATAGAGGTCCAGTAGATTATTCCACACATTCTCCACAATCTACAATAAGTGAGGAAAGATCTCCTTTACATTACGAAACGGGTGATGTCGTTGTGAACAGTAACATGTATATGTGCAGTAAGAGAGAATATTATGACCAATACGCGCAAGTTCATTACACACCAGAAGCTTGTGGCAGTGGAAACATTGAAGGAATCGAATACCCGGCCGCTAAAAGAATGAGATTAACAACGCCACTGAGCATAGAGGGCACTGATGGCATGGAGAGATGGAACCCCAGCCCGCCCTGGTCTGATACACTCAAACTGACGGATTACACACAGAGATTTACTTATAATATGCTACCGGCGCCTACGGAAAGAACTATTGTCACTTAA

Protein sequence:

>DPOGS200963-PA
MFLRFEPTKSTKGASKMRRDLINAEISNLRDLLPLPPSTRQRLSQLQLMALVCVYVRKMNYFQQVFKSHDFSYQYQEQPTPTPNIGFSKAMNGFMMMMTQNGKLLYISENAAEYLGHSMEDLLIHGDSVYDIIDRQDHQMVQLELNRTSNSETENVPKNRHFFCRMNVSRNARRQMRFGDQKVVLVQGHYVSYLPLCSRNEPVFLASCTPLAMPETRECIVHGATNVFTTIHSMDMKILHIDTNGEWYLGWKKTDLIDVSWYQILHWDSLREAQTKHRLSNITQSEQDKCCILLVKLQQRSGSFLWIHMVLQVKEAADAPRQFIVATNQVLSEEEASIMVSNSWLYQYYGYQNQNCGMLDPRCQKFFRREPYYPDPYPEYQYVENEIDYTGYHITPYVCQKADDYGCERIYTDRGPVDYSTHSPQSTISEERSPLHYETGDVVVNSNMYMCSKREYYDQYAQVHYTPEACGSGNIEGIEYPAAKRMRLTTPLSIEGTDGMERWNPSPPWSDTLKLTDYTQRFTYNMLPAPTERTIVT-