Monarch geneset OGS2.0

DPOGS206728
TranscriptDPOGS206728-TA1695 bp
ProteinDPOGS206728-PA564 aa
Genomic positionDPSCF300320 + 71744-75036
RNAseq coverage30x (Rank: top 76%)
Annotation
HeliconiusHMEL0051454e-7743.23% 
Bombyx% 
Drosophila% 
EBI UniRef50UniRef50_F7G2Z31e-2044.23%Uncharacterized protein n=19 Tax=Euteleostomi RepID=F7G2Z3_ORNAN
NCBI RefSeqXP_001181357.16e-2348.54%PREDICTED: similar to AlkB, alkylation repair homolog 2 (E. coli) [Strongylocentrotus purpuratus]
NCBI nr blastpgi|720543911e-2148.54%PREDICTED: similar to AlkB, alkylation repair homolog 2 (E. coli) [Strongylocentrotus purpuratus]
NCBI nr blastxgi|720543916e-2148.54%PREDICTED: similar to AlkB, alkylation repair homolog 2 (E. coli) [Strongylocentrotus purpuratus]
Group
KEGG pathway 
Orthology groupMCL34674 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206728-TA
ATGTACATTGCATTCAAATGGCTGATGGTCATGGCCTTAGCCATGTGGCTGTGTCTGTGTTTTGTGGATATTTATCAAAATAAAATATATGATTTAATAAACATAGAATTCAAAGTCAAAAGTATTTCAGCGCCCATGTTTAAAAACTTAAGTGCAAAACAACAACAGCCATACAAACGTAAAGAAACTCTCAATTTAAGAATTAAAAATGGTGACAACAACCTGTTTCAGTACGTGATGTTCGTGGCTGGTGCATGTATGGTACTAATTTATAACAAAAAAAATCATATCATATTATATTCTACGGAAACCGTGAGGAGAATTAATATAATACTTCTAAAAAAAATCATCGTAATTATAAAGGAGTCGATTCCTTTCTTTACTGGGACTGTGATTAGGGACGTCCAGAGAAGTTTGAAAGCGATAAATAAGGTTTTTATAAAGATAAAGGCACACACTTCAAGTATATATTCCATGTTTAGACAAGAAAATTATAAAAGAGTGAATATAGTTTTACTAGAAAAAGTAAAAGAATTAGGACAAGAACGAAAAAATCTCAGTCAACTTCTAATGGCTGCCATCCAAGAAAATAAAAATATTAGAATGAGATGTCAGTTGGAGAATTTGGCAAAAAATAGACTGGCGCGTCACATCGAAAACACACAAAAGCAAATAAAGGAAAACAAAACGAGATATGTAAATTTCCAGCAATTGTATTTGACAACACATCAAGAAAATATTTTTCTAAAAAGCCGAATGAAAAATCTAACAAGAGAGAAAGACGAAGCGGAAAGGAATCTGATTAAACTTGTGAATCAAGTTTGTCAGTCAAAGAACAACGACCTCAAAGCTTACTGTAGTCGTTTTATTGTTCGCACTAAAGACAATCTATTAAATTCTGATGTACGCTCTGAAATACAACAGTTCCTTCAGAAGCCAGGACAGTCCAGTGATAAGAGTTCTGACATTGCCATAAGCCACAAACTCACTGAATGTCCGAGTAAGAAATTTGAATTAGCAGAAACATTGCAAAACGATGAAAATCTGATCGACTTAGTTGAAGACGCCCCGAAATTGCGAGGTCTACCGGGGGAGTGTGTTTGGACGGTAAAAGACAAAGAAGGAATGATCCAAAAACTTTATGAGTACGAAACTGAGTTTGATAACGGCGACACGATCAGACGAATAAGACAATATTCCGTTTATCATGATAAGGACTGTTTGTTGGATTTCTCTAATTCGACGACGTTTATCAGAAATTCCAACATCGATATTTCTACACTGCGGAAGTCTTTTGTGAACACGGATAGCAAGATAACAAAATGTTACCACCATATGATGTCGGATAATTTAAAAAAGGTAGATCTTAGTTGTATCAGTTGGAAAGTTATTAAAAATGATGGATTGGATCTTGAATATACTGTTCCAATACCAAGATCAATAGCTGACGGCATCCTGATGGAGCTCGAAGAAACTTTATGCTTTTTTACAGGAGATCTAGCCCAGATTAAAGTTTTCGGTAAAATTTATCCCCTGCCTCGACAGCAAGTGGCCTATGGTGACCCTGGAATTACATACACCTACTCCGGCATAACCGTACCAGCCTTACCGTGGCCTAAACCTGTATTAATGCTGCGAGATTTTTTGTTTACTTTGAAAGGCATAAAATATGAGTTTGTCTTAGTCAATAAGTGA

Protein sequence:

>DPOGS206728-PA
MYIAFKWLMVMALAMWLCLCFVDIYQNKIYDLINIEFKVKSISAPMFKNLSAKQQQPYKRKETLNLRIKNGDNNLFQYVMFVAGACMVLIYNKKNHIILYSTETVRRINIILLKKIIVIIKESIPFFTGTVIRDVQRSLKAINKVFIKIKAHTSSIYSMFRQENYKRVNIVLLEKVKELGQERKNLSQLLMAAIQENKNIRMRCQLENLAKNRLARHIENTQKQIKENKTRYVNFQQLYLTTHQENIFLKSRMKNLTREKDEAERNLIKLVNQVCQSKNNDLKAYCSRFIVRTKDNLLNSDVRSEIQQFLQKPGQSSDKSSDIAISHKLTECPSKKFELAETLQNDENLIDLVEDAPKLRGLPGECVWTVKDKEGMIQKLYEYETEFDNGDTIRRIRQYSVYHDKDCLLDFSNSTTFIRNSNIDISTLRKSFVNTDSKITKCYHHMMSDNLKKVDLSCISWKVIKNDGLDLEYTVPIPRSIADGILMELEETLCFFTGDLAQIKVFGKIYPLPRQQVAYGDPGITYTYSGITVPALPWPKPVLMLRDFLFTLKGIKYEFVLVNK-