Monarch geneset OGS2.0

DPOGS203550
TranscriptDPOGS203550-TA1314 bp
ProteinDPOGS203550-PA437 aa
Genomic positionDPSCF300055 + 476041-489025
RNAseq coverage380x (Rank: top 31%)
Annotation
HeliconiusHMEL0041072e-7263.03% 
BombyxBGIBMGA004247-TA2e-10963.31% 
DrosophilaCG34348-PA7e-7648.47% 
EBI UniRef50UniRef50_Q9VZ051e-7348.47%CG34348 n=11 Tax=Pancrustacea RepID=Q9VZ05_DROME
NCBI RefSeqXP_002106669.17e-7548.81%GD17011 [Drosophila simulans]
NCBI nr blastpgi|3287929039e-7847.73%PREDICTED: transmembrane protein 68-like [Apis mellifera]
NCBI nr blastxgi|3838617191e-7849.50%PREDICTED: transmembrane protein 68-like [Megachile rotundata]
Group
KEGG pathway 
Orthology groupMCL14967 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203550-TA
ATGGTTGAGAATCAGTGCAAAGCCAGGCCAGTAAAAGCCTTTATAATGAGTGATCTTCTTACAAAATTCGATTCGGACGCTACCAACGACATCGTCAAAGTGGCTGTTGTTGTTGTATTTTGTTTGTTATATCCTCAAAAACCAAATTCCGTGTCTGCAGCCGATTGTATTGTATCAAACAAGTTAAGCCACATCAAGAGCGGTTCCATCTTGATGGGTGGGCGAACTAATTTTGGTCAGAGGATATCTCACGAGGTCAGTCGTAAAGACTCACGCTCACGGTCTAGACCATACAGTCTAGACGTCTCCGAGACCGCGTTATCACCCGGATTTCCTTCATCGATACACATCCATGGAATAAAAAAAACTGATAAGGACCATCGACACAGCATCCGGGAGGATGTGGAGTATGTGGACACGGAGTACAGTCTCTGGCTGAGCTGGTTCCTGACTCCCGTCATCGTGACCTTCCTTCTACCAGCTGTTATCATCGTCCTCATCTATGTCAGCAGCATCATCTTCCATCTATACAGGCTTTACAGGCTTCGTGTGGTAGACGGAGTTCACAATGACTGGCGACACGCGGCTAGACTTGCTGTGTGTGCGTTGTGGGATGCCCACGGCTGGCTGTGGCATGGCTGGTCCTCTCTTCTCGAGGGTCTGTGTGTGATCCCGGGGACAGTTCAGACTTGCGCGGGGGTTCTAAGGTCAGGTAACTCGCTGGCCATCTCCCCCGGAGGAGTGTATGAGGCCCAGTTTGGGGACCACTACTATAAACTGAATTGGAAGTCCAGGATAGGATTCGCCAAAGTCGCCCTGGAAGCTAAAGTCCCGATAGTACCCATGTTCACTCAGAACGTCCGCGAGGCGTTCCGTACTGTGGGCTGGTTGCGAGGGATATGTCTCCGTATTTACGCGGCTACCCGTGTACCTCTCGCGCCCGTCTACGGGGGCTTCCCCGTCAAACTGGTCACGCACGTGGGTACGCCCATCCCGTACGACGCTTCTCTCACTCCGGAGACACTTCAAGTGAAGGTCGCGAGTGCCATAGAAGACTTGGTTGAGGAACATCAGCGGGTCCCCGGGAGTATTCTGCTCGCCCTCATCGAGAGGGTGTACGAGATGCCAAAAAAGAAGAAGGTTCAAACGAACGGCTCATGTCACAACGGCGTCGCAAAACAGAACAATTTCCCTGACAAGTGTGACACTGACAGGAGTGACGTCGACCCTGTCAAAAGTGACGTCGACCCTGTCAAAAGTGACGTTGACAGCGGCAAGTGTGACAGTGACAGGAGCGACAAGAAAGTATCATAG

Protein sequence:

>DPOGS203550-PA
MVENQCKARPVKAFIMSDLLTKFDSDATNDIVKVAVVVVFCLLYPQKPNSVSAADCIVSNKLSHIKSGSILMGGRTNFGQRISHEVSRKDSRSRSRPYSLDVSETALSPGFPSSIHIHGIKKTDKDHRHSIREDVEYVDTEYSLWLSWFLTPVIVTFLLPAVIIVLIYVSSIIFHLYRLYRLRVVDGVHNDWRHAARLAVCALWDAHGWLWHGWSSLLEGLCVIPGTVQTCAGVLRSGNSLAISPGGVYEAQFGDHYYKLNWKSRIGFAKVALEAKVPIVPMFTQNVREAFRTVGWLRGICLRIYAATRVPLAPVYGGFPVKLVTHVGTPIPYDASLTPETLQVKVASAIEDLVEEHQRVPGSILLALIERVYEMPKKKKVQTNGSCHNGVAKQNNFPDKCDTDRSDVDPVKSDVDPVKSDVDSGKCDSDRSDKKVS-