Monarch geneset OGS2.0

DPOGS211162
TranscriptDPOGS211162-TA1278 bp
ProteinDPOGS211162-PA425 aa
Genomic positionDPSCF300007 + 152938-155994
RNAseq coverage45x (Rank: top 71%)
Annotation
HeliconiusHMEL0172075e-17779.86% 
BombyxBGIBMGA003147-TA3e-9479.02% 
DrosophilaCG4133-PA9e-8740.68% 
EBI UniRef50UniRef50_E0VY631e-10844.70%Putative uncharacterized protein n=3 Tax=Neoptera RepID=E0VY63_PEDHC
NCBI RefSeqXP_002431057.12e-10944.70%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2700070903e-11647.31%hypothetical protein TcasGA2_TC013541 [Tribolium castaneum]
NCBI nr blastxgi|2700070909e-11447.42%hypothetical protein TcasGA2_TC013541 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL16376 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211162-TA
ATGAGGACCACGTCGTTTACACTCGAGGTTTTGGTCCTACTTTTTCTCAACCTGATCCCTTGGCCGGGACCATCCGTCTTCAAAATCACATGTTCTCGTGACAATTCCAAAATCGTAAGAAAAGTTATAGAAAACAAGTGGTTACCGGTGTTGGAAAGGTTTCAGGTAAAACTGCCGTTGGAATGTCCATTTCATCCGGCTCGAGATATCTTTGCGCCACAGCACGCCGCCAAGCAGCAACATCGCTCTTCGCAATGGACATGCGCCTTCTGCGGCAAGTCTTTCTACGAGGAGAGACATCTCGATACTCACTTCGATCAGCGACACCGGCACCAGATTAATAAGGCGGAAGACGCAGTATGTCTTGCGGACTACTGCGATATAATGCGCTGCCAGGTTTTGGTTGCGCATGGTCTGCTGAGCCGGGGCAGCGGACCCGTCACTGAAGTTGAGCTGTGGCAGGACGCTGAGGTCGCCAAGAAAGCCCTAGCCCCGGCCGCCTCGAGGAGCGTTGCAAGAGTTACCAACAGAAGAAGACAACGCACTCGTCCTCCCACTACTGTTGCCGACTGCCCTAAATCGGATCATGATAGCGAGACAGGTACACATACTCCCACTATAGAAACCGAAGATAAAAAAAGAGAAGAGAATGAAGGAGATACGAATCAAACAGCAGAATTATGTGACGCTGACACGGTAGAGTCTTCGCTGCCGCCAGATAGTCGTCAGAAGAGGTTGGCAGATTTACAGGCAGAAAGAGCTGCCTGTGATCCAACGCACTTGCAAGGCCTTAAGGTCCGCTGTGAGAGGGTCGTGCATTCCTGCATAGCAACATTATTGTTGCATCTAACGCAGCATCAGTTTTCAGAACTTGAAGAGGAGATGCAGCGTGCGGTATGTTGGTACTTGAGCTGTGACCGCTACTGGGAGGACACCGCGCCGGCTGCGCGTGCCTTCCCCTGGCCCCTTCTCCTCGCGCTGGCCACAGGTCTAGCACTCGCACTCTGCATGTGCTACTACATCATATGGATCGTCTTTGATTCGGAAGAAGGTAGTATAGCTGGTAGTGGTTCGGTGTCGATGACGACTCACTCGTCGCCAGCGCGCGGTGCTGATGACAGCTTGTTGGATGAAGATCACGATCACGATCACGACGACCACTACATATATGTGACTTACCCGCCGGAACTGAAGCGTCGACTGCTCGAGAGGTACGCCAGAAACCATCTCACACTTATCATACTCTCGCCGTATATCAGACAGACCAGGGTCGATTAA

Protein sequence:

>DPOGS211162-PA
MRTTSFTLEVLVLLFLNLIPWPGPSVFKITCSRDNSKIVRKVIENKWLPVLERFQVKLPLECPFHPARDIFAPQHAAKQQHRSSQWTCAFCGKSFYEERHLDTHFDQRHRHQINKAEDAVCLADYCDIMRCQVLVAHGLLSRGSGPVTEVELWQDAEVAKKALAPAASRSVARVTNRRRQRTRPPTTVADCPKSDHDSETGTHTPTIETEDKKREENEGDTNQTAELCDADTVESSLPPDSRQKRLADLQAERAACDPTHLQGLKVRCERVVHSCIATLLLHLTQHQFSELEEEMQRAVCWYLSCDRYWEDTAPAARAFPWPLLLALATGLALALCMCYYIIWIVFDSEEGSIAGSGSVSMTTHSSPARGADDSLLDEDHDHDHDDHYIYVTYPPELKRRLLERYARNHLTLIILSPYIRQTRVD-