Monarch geneset OGS2.0

DPOGS203268
TranscriptDPOGS203268-TA1875 bp
ProteinDPOGS203268-PA624 aa
Genomic positionDPSCF300229 + 61659-65362
RNAseq coverage359x (Rank: top 33%)
Annotation
HeliconiusHMEL0153580.087.40% 
BombyxBGIBMGA000448-TA0.089.93% 
DrosophilaCG10217-PB0.070.89% 
EBI UniRef50UniRef50_E3X9670.071.03%Putative uncharacterized protein n=10 Tax=Neoptera RepID=E3X967_ANODA
NCBI RefSeqXP_395969.30.080.82%PREDICTED: similar to CG10217-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3800143550.075.65%PREDICTED: uncharacterized protein LOC100871601 [Apis florea]
NCBI nr blastxgi|3800143550.075.65%PREDICTED: uncharacterized protein LOC100871601 [Apis florea]
Group
KEGG pathwaynvi:1001143262e-11 
 K12386 (CTNS)maps-> Lysosome
Orthology groupMCL14700 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203268-TA
ATGGTTCTTAAATCAATTTTAGTTTTAGTTACTTGTGTGATATTTATTTTACATGGACTTACGTCGGTTTCGGCGGAATGCACAATTCCAGTGGTTCTTCGAAATACTTGGTTTTCTTTTGAAAATGGAAAGCAAACGATCACTGATATAAATGCCAATGAAATGACTGGAAGGGGAATATGTATTAATATTAAAGCAGATTATCATGTAAACTACACAATGGTTTTTCAACACTCCAAGTGCTATTACTGTGTTAAGTTAATAGTTCGAACTGTGAATGTTTTGGAGAAAATTGAAACACCTTGTGTTGATTTGCCACCTGATCAAGAGCCTACAGTAGAAAGAGTGTGCAAAGGGTTTAAACCTGATCAATCTCTGATAACATTGTTCTCGGAGAATTCAGTTCCAGTTAACTGTAGGTCTTCACTTGAAGGTGTATGGCAATTTGCTTATCAGAATCGTTTCCGATTCACTGGTGAGTGTAACCACCCAGGTGCTCAGATTAAATCGTGTCAAACAGCTGGGACTCAGTTTCTTATAACTAACCAGAAGTTCAATATAACTTATAAGGAATGTCCCGGTATGTCTGGTACTTTTGAAGGTGTAGTTGAATTCAGCTGTCTAGGACATTGGTTTGTCGATAAGAACCACTTCTTTGCTGTGGCGAATACAAAGGAGTCACGTAAGGATGAAAGATACCGTTGCTTCCTTAAGAATCGGGACGATGACCTGTATATTGGTGCGTCCATAACACCTCAATGCAACACTTTGAAAACTGTCGAAAAGTCGCCGGAGAGATACAGAATAACACCAGTGAAGGCAGAAGTAGTGGAACCAGGTTGCCGTTTGCCTCAAAACTTTTCCGGAGACTGGATCAATACAGCAAATATTGATGCTGATGTGTTCATCAACGAGACTCACATCATTGAAACTTATTATCCAGATGAGGGGAGATACAGAAGGACAATATATGTGTGCAAAGAGCAACGTGACAGTCGTGTTATGATGGCCCGGCTTACAGTTGATGGTTGTCAAAAAGATTACGTCTGTTTTGACTTTGTACCTCAACATCATAATATCATAAGATATCGTAAAGGCCTAGCCATGATACAAAGTAATTTCCACACAGTCTGCTCATGGGTACAATTTCCGAACAAACAGAAATGGCGTTACGATTTATTCTTGAAGAGAGATCCCTCACCTATAAGATGTCCTGTTGCCGGTAAATTTAACTTTACACAAAGAGGAGACGTCAAATTTGAGACTAGAATACTCGGTGGAGTAACTTTGAGTCCACGTCCGAACTTGTACTGCAAACTGAACATAAGTGACTTTTCTGTATGCGATGTAGATCAGAAGACCATACAAATAAAAGAGAATTATTGCTTAACCGTGGACCATTTGGGTCGACCAGTGGATATTTACAGTGACCCAGATTATAAAATGAAATGTATCGGATATTGGAAGGAGAATTTGAAGTCTTATTTGATCACATACGACGAATTGGATCCCTTCTCAAAATATAGATGTTGGGTTTACCAAAGAGCTGATCTCAACAGAGTTCTTATGTCTCAAGCTCTGGGTCCGTACTGCGATTTGAAGCAAGATGTAACATCATGGAATTACACTGAGGGTGCCGCTGTGGCTATTGAAATGGAAGAATATGAGAGGGAGAGGGATCAATGTCCTATGCATTTCGATGATGGTAGTGACCCCTGGTCAACCAAAGAAAATTATATTAAGGTGTTTAACTTTGCTTACTCATTTTGGAGAAGCAATGGTGCAGCCACCATAACAATGTTTTTACCTCTCACAGCTTTAGTTTTTGGTATAAATATTTGGAAGAATCTTAATATTTTCTGTAGGTTAATGTAG

Protein sequence:

>DPOGS203268-PA
MVLKSILVLVTCVIFILHGLTSVSAECTIPVVLRNTWFSFENGKQTITDINANEMTGRGICINIKADYHVNYTMVFQHSKCYYCVKLIVRTVNVLEKIETPCVDLPPDQEPTVERVCKGFKPDQSLITLFSENSVPVNCRSSLEGVWQFAYQNRFRFTGECNHPGAQIKSCQTAGTQFLITNQKFNITYKECPGMSGTFEGVVEFSCLGHWFVDKNHFFAVANTKESRKDERYRCFLKNRDDDLYIGASITPQCNTLKTVEKSPERYRITPVKAEVVEPGCRLPQNFSGDWINTANIDADVFINETHIIETYYPDEGRYRRTIYVCKEQRDSRVMMARLTVDGCQKDYVCFDFVPQHHNIIRYRKGLAMIQSNFHTVCSWVQFPNKQKWRYDLFLKRDPSPIRCPVAGKFNFTQRGDVKFETRILGGVTLSPRPNLYCKLNISDFSVCDVDQKTIQIKENYCLTVDHLGRPVDIYSDPDYKMKCIGYWKENLKSYLITYDELDPFSKYRCWVYQRADLNRVLMSQALGPYCDLKQDVTSWNYTEGAAVAIEMEEYERERDQCPMHFDDGSDPWSTKENYIKVFNFAYSFWRSNGAATITMFLPLTALVFGINIWKNLNIFCRLM-