Monarch geneset OGS2.0

DPOGS207549
TranscriptDPOGS207549-TA2019 bp
ProteinDPOGS207549-PA672 aa
Genomic positionDPSCF300072 - 1038785-1042927
RNAseq coverage439x (Rank: top 28%)
Annotation
HeliconiusHMEL0205372e-12182.71% 
BombyxBGIBMGA004686-TA0.080.40% 
DrosophilaCG8135-PA0.054.56% 
EBI UniRef50UniRef50_Q8MRQ40.054.56%LMBR1 domain-containing protein 2 homolog n=15 Tax=Arthropoda RepID=LMBD2_DROME
NCBI RefSeqNP_001171929.10.058.47%LMBR1 domain-containing protein 2 homolog [Tribolium castaneum]
NCBI nr blastpgi|3227991850.059.63%hypothetical protein SINV_05916 [Solenopsis invicta]
NCBI nr blastxgi|3227991850.060.06%hypothetical protein SINV_05916 [Solenopsis invicta]
Group
KEGG pathway 
InterPro domain[6-538] IPR0068766.1e-102LMBR1-like membrane protein
Orthology groupMCL13482 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207549-TA
ATGGCATACACGTTGTTTGTGGTCGAAATAATATCGGCTTTTATTTTAGCAGCTACCTTATTATATAGATATGGGGATTGTTATAGAAATCACATATTAGTTACAGTATCTGTTCTAATAGCATGGTACTTCTCATTTGTTATCATGTTTATTCTTCCTTTGGATATATCATCGACTGTATATAGACAATGTGATAATGGAACTCATCCCGTCACTGCTGCTACAGTTACTGGGACTGGTAATGGCTCTATTTCAACACCCACCCCTGATACTCAGTGTCAAAAGCCATGGAGTTATGTTCCAGATATTGTCTTTCCAAACCTTTGGAGGGTAGTTTATTGGACTTCTCAATGCTTAACTTGGCTTATCATGCCAATGATGCAATCGTACAGCAAAGCTGGAGATTTTACAGTGAAAGGGAAATTGAAATCGGCTTTAGTTGACAATGCCATTTACTATGGCTCGTATTTGTTTATCTGTGGCATACTTCTCATTTATATTGCCCTTAAACCTGGTGTTTATTTGGATGGTCCAAAAATCAAAGCCATAGCATCCTCTGCAAGCAACACTTGGGGCTTATTCCTGTTGATCCTACTTCTTGGGTACTCTTTAGTAGAAGTACCCAGGAATTTATGGAATAATTCCAAGAAAAATTACACTTTGACCTATAGCTATTTCAAGGTAGCAAAATTGAGTACAGACAAGTGTGAGGCTGAGGAAACAGTTGATGACATATTAGAGTGCTTAAATTCTGTAACAGCAGCAGTAGGGCCCGGCCATCCTTTACATAGTCATGTTGAGACTATAGTACAAAAGTTGCCGATTCAATTAAGGGATAAACTGAATTCCAGATCTCCGCCAGAAAGGCCTACACCACCTTCCCTGAAGTCTCTAGTCAATTTACACAAAAAAACAATAAAGGCCTTGCATGTTTTACAACGAACGGAAACTCAATGGGGTCTCATGATGGAACGTATCTTTCATTTAGAAGATGTCGCATCCAATTCCCGCTCACCAGACAGGAGATTCCAACACACATTCCATACTCCACGACCAAGACTACAGCGGATATTTTATCCTCCTATCATTGAGTGGTACTGGGAATGTTTCCTTCGTCAGTACTTCCTGAAGGCCATGTTTGTAGTGACATGCATATTGTCAGCCGCCGTTGTCTGGTCAGAGTTGACATTCTTCTGTAAAAAACCTGTTCTCTCAATATTTGCTAACATAGTGCTGGCAGCTAAATCAACCTACAATTATGCTTGTATTGTTTCCATATCAACCTTAGTTATCGGTTATATGTTCTACTGTGCATATTCAACGGTACTCAAGATTCGTCTCCTTAACCTCTACTATCTGGCTCCTCACCATCAGACCAACGAATACAGCCTTATATTCTCCGGTATGATGGTCTGCAGACTCACTCCCGCCATGTGCCTTAACTTCCTAAGTTTGGTGCATATGGACTCTCACGTCATCAAAGAGAGGGTCATGGAAACTTATTACACTCAGATCATGGGTCACATGGACGTACTCGGCATTATTGCTGAGGGATTCAACATATACTTCCCGATGCTGGTGGTGTTGCTGTGTCTTGCGACATATTTGTCGCTGGGAAGTCGCTTGTTGTCGCTGTGCGGCTTCCAACAGTTCGTGGGTGACGATGAACTCACAACGGATTTGGTCGATGAGGGACGGGAATTTGTTAAGAGAGAAAAACGCAAACGTCAACGTGCCGAAGAGTCCCTCGCCCGCCGCCGCGACTACAGCGAACGGTTCACGAACAGGAGGGATGAAGAGCACGACAACGCCCGTACCGGGCTCCTCAACGACGTCGATTCAGACTACTACGTGAGCCCCGACAGACACTCCTACCAGCGGGACACCTTCAGACCTGGCGTCGAGGACATCGAACAACGGTTCGGCGAGAGCGCGCTGCCCTCCATCAGGACGGAATACGACGGCCGAAGACGGGACAAGATGACGATGCCTCCTAAAGGATTATTCGACGATGTATAG

Protein sequence:

>DPOGS207549-PA
MAYTLFVVEIISAFILAATLLYRYGDCYRNHILVTVSVLIAWYFSFVIMFILPLDISSTVYRQCDNGTHPVTAATVTGTGNGSISTPTPDTQCQKPWSYVPDIVFPNLWRVVYWTSQCLTWLIMPMMQSYSKAGDFTVKGKLKSALVDNAIYYGSYLFICGILLIYIALKPGVYLDGPKIKAIASSASNTWGLFLLILLLGYSLVEVPRNLWNNSKKNYTLTYSYFKVAKLSTDKCEAEETVDDILECLNSVTAAVGPGHPLHSHVETIVQKLPIQLRDKLNSRSPPERPTPPSLKSLVNLHKKTIKALHVLQRTETQWGLMMERIFHLEDVASNSRSPDRRFQHTFHTPRPRLQRIFYPPIIEWYWECFLRQYFLKAMFVVTCILSAAVVWSELTFFCKKPVLSIFANIVLAAKSTYNYACIVSISTLVIGYMFYCAYSTVLKIRLLNLYYLAPHHQTNEYSLIFSGMMVCRLTPAMCLNFLSLVHMDSHVIKERVMETYYTQIMGHMDVLGIIAEGFNIYFPMLVVLLCLATYLSLGSRLLSLCGFQQFVGDDELTTDLVDEGREFVKREKRKRQRAEESLARRRDYSERFTNRRDEEHDNARTGLLNDVDSDYYVSPDRHSYQRDTFRPGVEDIEQRFGESALPSIRTEYDGRRRDKMTMPPKGLFDDV-