Monarch geneset OGS2.0

DPOGS211727
TranscriptDPOGS211727-TA2223 bp
ProteinDPOGS211727-PA740 aa
Genomic positionDPSCF300239 + 91694-107997
RNAseq coverage3075x (Rank: top 4%)
Annotation
HeliconiusHMEL0173514e-6448.58% 
BombyxBGIBMGA013974-TA6e-7942.81% 
Drosophilalig-PD8e-2038.69% 
EBI UniRef50UniRef50_E0VHH82e-2036.57%Lingerer, putative n=1 Tax=Pediculus humanus corporis RepID=E0VHH8_PEDHC
NCBI RefSeqXP_002425572.13e-2136.57%lingerer, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3320203881e-2031.62%Protein lingerer [Acromyrmex echinatior]
NCBI nr blastxgi|3287890755e-3428.03%PREDICTED: hypothetical protein LOC413433 [Apis mellifera]
Group
Gene OntologyGO:00055152.5e-20protein binding
KEGG pathway 
InterPro domain[25-118] IPR0090602.5e-20UBA-like
[242-269] IPR0221661.2e-07Protein of unknown function DUF3697, ubiquitin-associated protein 2
Orthology groupMCL26155 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211727-TA
ATGAGTTTGGGTGCTCGCGCCACCAAGGGTGGTGTGAAGAAAGACGGGAAGCAGGACAAAGGAAAGGTGGCCGAGAAACCTGCGCCCAAGGAAAAGACTAAACCACAGGCAACCACAGAACAGCTAAGGATGGCCAACATGATTGACTGCAAGAGTGAAGACGCCTCCGATGTACGGAGAATGGTGACAGAGTTGATGGAAATGACCTGTCGCACCGAGGAGGAGGTGTGCTCGGCCCTTCATGACTCAGACAACGACCTGCAGGCTGCCTGCAACCTGCTGCTGGAGGAGAGCCAGCGGATACAGGGCGAGTGGCAGACCAGCGAGAAGAAGAAGAAGAAACCTTCTCAGCCAGCCGGCAATGGCGATGCTGAACGGGAGAGAGATGCTCGCAGCAGATCGGGGCCTCGATCACGCCGTGGCGAGGGCGAGTCCAACCAGCCCCCCGGCGGCGGTCGGGGGCGAGGAGCCGGGCGAGGGAGGGGTGGCCACACACATAAACAGATTTGTGGTAACATTGATTCGTTCCCCACCACAGAGGATTGGGACAATGAGGAATGGTCGGGCTCTCTATCGGATACTAAGGTGTTTACTCCCAGCTCCAACGCACTTCCGGCGGAAGCCGAGCCGCCGGCTGTTACGGAGGATTGGGAGTCTGGAGAGGCGAACGGCCGCGAACACATACCGACCTACACATACCACAACACACACCAACACATACCGATACCATCGTCGGCGGTGGAGATGCCAGGTGGTACTGAGACATCGAGCAATATGTTCATCGACGTACAGTTTGGAGCCCTGGAACCCGACGCCCTACACGAGCCTCCGCCCCCCGTGCAGGGGGCGCCGCCTCCCACCACCACGCAATCACCTACAACCAGCATACAGACTAACAAACAGGTCACGTCACAAGCTGACAGCCAGCCCGTACCAAGCCACGAGACGACGACGGAGGCGCCCAGCAATAGTTATCCTAATGCCAATAGTATGTTAAGTGAGAGTATATCAGTAGTGGACCCTACACCCGCCAGCCAGCCGGTCACCTCGGACACAGCGGTCACTAACAGTAGCTCGTTGGATAAGTTGTCGGCGTCAATGAAGCAGCTAGGCGTGTCGGGGACGGCCAGCCCGGCCGCACAGGACTCGCACCGACATCACTACACACACCACCGACCGAAGTCTGTAGGTCAACAACAGAACGTGTACGCGCCTCACCACGTGTCGCACCACCCGCCCGTGTACGGAGCTAATGTGTACGCGCCGCAGTATGACAGTTCAGTGGCGTCCAGCAACAGCTTAACGAGCACCTCCACGACACAGACGTCCACGGCGAAGGTTACCACAACCACAGTGAGCAACGCGGCGAGTACGGGTGCGTCGTCAGCCACGAGCGCAGCGGGCGGCGGAGCGGGAGCGGGCGCGGGCGCGGCGTACGCGGGCGGAGCGTTGTACGGCGGAGCGTACGCTTACGATGAACAAATCATGAGAGGAACGCTACCGCACCATATGGGCGGATACTACGAGGTCGGCTACGGAGCTCGCGAAGGCACGTTCGGTCTGGGAGCGGGAGAGAGATTCGGTAGAACGGACGCCGCCTCACCACAACAGTACAGAGATTTTAACGTAGTTCCAAAGATCACTGTGATGAGCGGCGAGCAGAAGCCGTTACATCCGAGACTGGTCCCGGCCGCCTTACCTCCCGGATACGCGTACTTCTATCAGCCACCGCCTACGTCCTATCAGTACGGAGTTTATCCACCTTACGGTGGTGGCTCAAGCGTGGGCGGCGTGGGAGGCGTAGGTGGCGTGGGCGGCGTAGGCGGAGTGGGCGGCGTGGGCGGTGTGGGCGTGTCAGGAGGTGGAAAAGTATCCGCTTACTCACAACAACAACAACCGCCCTACGACGGCCAGGACACGTATAAACCAGGCGGGCCTTACACGGGTAGTGCGAATAAGACGGGCGGAGGCGCCAATGATCTCACCAACACAATGTATGCCAAGACTCATGTAGCTCTCAACAAGGTCAATGCTGGTTTTCATAGCGGCACCCCGCCACCGTTCGGCGCTGGGTCACACCTGTACATACCGGCCCCTCCGCACCACCATCCCCACCACCACGCTCCGCAACAACACCAGGAGAGCGGATCCGCGGGTCGTTCCTCGTCTAAACCACAGTCGAGCAAGCCGACGTATTCCCAATCATACTGGACGCCGAACTAG

Protein sequence:

>DPOGS211727-PA
MSLGARATKGGVKKDGKQDKGKVAEKPAPKEKTKPQATTEQLRMANMIDCKSEDASDVRRMVTELMEMTCRTEEEVCSALHDSDNDLQAACNLLLEESQRIQGEWQTSEKKKKKPSQPAGNGDAERERDARSRSGPRSRRGEGESNQPPGGGRGRGAGRGRGGHTHKQICGNIDSFPTTEDWDNEEWSGSLSDTKVFTPSSNALPAEAEPPAVTEDWESGEANGREHIPTYTYHNTHQHIPIPSSAVEMPGGTETSSNMFIDVQFGALEPDALHEPPPPVQGAPPPTTTQSPTTSIQTNKQVTSQADSQPVPSHETTTEAPSNSYPNANSMLSESISVVDPTPASQPVTSDTAVTNSSSLDKLSASMKQLGVSGTASPAAQDSHRHHYTHHRPKSVGQQQNVYAPHHVSHHPPVYGANVYAPQYDSSVASSNSLTSTSTTQTSTAKVTTTTVSNAASTGASSATSAAGGGAGAGAGAAYAGGALYGGAYAYDEQIMRGTLPHHMGGYYEVGYGAREGTFGLGAGERFGRTDAASPQQYRDFNVVPKITVMSGEQKPLHPRLVPAALPPGYAYFYQPPPTSYQYGVYPPYGGGSSVGGVGGVGGVGGVGGVGGVGGVGVSGGGKVSAYSQQQQPPYDGQDTYKPGGPYTGSANKTGGGANDLTNTMYAKTHVALNKVNAGFHSGTPPPFGAGSHLYIPAPPHHHPHHHAPQQHQESGSAGRSSSKPQSSKPTYSQSYWTPN-