Monarch geneset OGS2.0

DPOGS207266
TranscriptDPOGS207266-TA1236 bp
ProteinDPOGS207266-PA411 aa
Genomic positionDPSCF300008 - 660210-665292
RNAseq coverage495x (Rank: top 25%)
Annotation
HeliconiusHMEL0163334e-5992.20% 
BombyxBGIBMGA012130-TA1e-13877.45% 
DrosophilaCG5466-PA4e-6368.89% 
EBI UniRef50UniRef50_E9IPN22e-11755.94%Putative uncharacterized protein (Fragment) n=11 Tax=Neoptera RepID=E9IPN2_SOLIN
NCBI RefSeqXP_623560.22e-11453.60%PREDICTED: similar to CG5466-PA [Apis mellifera]
NCBI nr blastpgi|3227943746e-11755.94%hypothetical protein SINV_10773 [Solenopsis invicta]
NCBI nr blastxgi|3227943743e-11456.22%hypothetical protein SINV_10773 [Solenopsis invicta]
Group
KEGG pathway 
InterPro domain[77-188] IPR0222071.9e-29Protein of unknown function DUF3736
Orthology groupMCL16569 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207266-TA
ATGGATTTAGCGGAGACCATGAAAAGTGTTTTGGCTAAAAGTACGGGAAATGCAAGTGGTGCTTCAAGTAGTACTAGTGCGGCCGCCCCGCATGGCGCTGAAGGCTATGTGACAGAAAAACTTTACATGTTATTGCAGTTGTACTTGCAAAACAAAGGATGGAGTCCTAGTATAGAATTATTACAGTGCTTTAGTGATCTCAAGGAGTCTTCAATGTTGCCAAGTGCAGCATACTTACAAATGATGGCTTCAAGGGTGGGTCTTGACGCTCAAGGTAGACTTATATACAGAGAGAATGGTAAAATTATATTGCCATATGAACATTTTGCTAATGCAGTTATGCTTAAACATATGAATGGTCCCCACGGTTTGCATCTCGGCTTGGAAGCGACAGTGAGAGCTGTTGTGGAATCATATACCATCGGCCGCGATCAATTTGGAATGGAAAAGGAATTTATTGTTGAGGTTGTCCAAAACTGTCCGAACCCTGCCTGTCGTTATTATAAGAGCCAGTTGGAAATGACACAGAAATCTATACAGCAACATTTGTCACAACAACCCACATATATACCAGAAGCGGGCGGTGCTGCTCTTCCGGACAACGCGGCGGTGGAACGTTTGCTTCGAGGGGCCGCCCTGCCTCCTGACTATCAGCTGCCACTGGCACCACACCTCGCCGCTCTCACAAACCTCGCCGGTGATAAGTCAGAGCGTCGTGGTGATAAGATGTCAACGCAGCAGATGCTGTTACATCAGAACCGTTCCCTCGGCCACGGGGACAAGTACTCACACTCGAGGACTCACTCCGGAGACAACAAGAACAAACATCACTATGATGAACATAAACTTACTGACTTTTTGAGAGCCAATCTAGAGAGTTTGGAGTCTCTATCGAGCGGTAGTCACGAGCGCGGTTCGGGTGCGTCGGGTGGTGGCGCCAGTGTGAGCGGTAGCGGTAGTGGGAGCGGAAGCGGAGGTGGAGGCGGACAAGAGCGTGTGGTGCGAGCGTTCGCGGAATTGGCGCGGAATTTACAGCGGATGAGACCATGTGTACGACCAGCCATGTGCAAACCGTATGGGAAGCAAATTCTACTGGACACCATTCAGCTGGTGCAATCTCTCCGGAGCTACCTGCCTCCCCCACACATACAAGTCACCTCGTGGAAGAATGATGACAAATTACGTTCCAACAACATGGATGACCTCGAGAGTCGTAAGATAGTGAGCGGCAACTGA

Protein sequence:

>DPOGS207266-PA
MDLAETMKSVLAKSTGNASGASSSTSAAAPHGAEGYVTEKLYMLLQLYLQNKGWSPSIELLQCFSDLKESSMLPSAAYLQMMASRVGLDAQGRLIYRENGKIILPYEHFANAVMLKHMNGPHGLHLGLEATVRAVVESYTIGRDQFGMEKEFIVEVVQNCPNPACRYYKSQLEMTQKSIQQHLSQQPTYIPEAGGAALPDNAAVERLLRGAALPPDYQLPLAPHLAALTNLAGDKSERRGDKMSTQQMLLHQNRSLGHGDKYSHSRTHSGDNKNKHHYDEHKLTDFLRANLESLESLSSGSHERGSGASGGGASVSGSGSGSGSGGGGGQERVVRAFAELARNLQRMRPCVRPAMCKPYGKQILLDTIQLVQSLRSYLPPPHIQVTSWKNDDKLRSNNMDDLESRKIVSGN-