Monarch geneset OGS2.0

DPOGS215583
TranscriptDPOGS215583-TA2757 bp
ProteinDPOGS215583-PA918 aa
Genomic positionDPSCF300097 + 155399-159972
RNAseq coverage280x (Rank: top 39%)
Annotation
Heliconius% 
BombyxBGIBMGA009188-TA1e-2736.47% 
Drosophila% 
EBI UniRef50UniRef50_D6W9C73e-2122.42%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W9C7_TRICA
NCBI RefSeqXP_001660842.15e-2326.53%hypothetical protein AaeL_AAEL010459 [Aedes aegypti]
NCBI nr blastpgi|1571261931e-2126.53%hypothetical protein AaeL_AAEL010459 [Aedes aegypti]
NCBI nr blastxgi|725470362e-3925.38%proteophosphoglycan 5 [Leishmania major strain Friedlin]
Group
KEGG pathway 
Orthology groupMCL18015 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215583-TA
ATGGCTTTGTTCCCCGCGTATTCATCGGACGCTGTAGAAAATACTACTATAAAGGAGCGAGTGTACATATCAGAAAACCCATCTCAATTTATCGAGAGCGAGTTGTTGGCGAGTGATTCTGAAGAGAGTAAGGACGAGCCTGCTGAGAAGTTGTCCCGTGTATTTGTCAGAGGTCCCCGGCCTCAGAGTAATGATGACTTCTACATTGATCGCAAACTCGATCGCGGTAATCTTCGGGTCTCCACCCTCTACTACCCTGGCAGACCGCATTTGGGCTGTCCTAAGCGCTCCGCCGGTCACACGTGTTCGCGTCACGTCTCACCGACCTCACACGTCGCCAGGTACGAGTGTAAGGTCCGCCGATTCTCCGAGTCCAGGGGCGAGGCTCGCAGCCGGAGGTACTTCACCCGCCGCGTCCCAGACTCCGACCCGGATCTGACAGACCTGCAAGAGAGAGCCGCCGCCTACCGCGACCTGCTGCAACGAAACCCCACCGACGTTGCCCTCTGGGAGACCTTCATCGACTTCCAGGAGCGGTGCGGCGGACAGGACGCGGCGCTGAGGGCGGTGGAGGAGGCGGTCGGCCGCGCCCCGAACAGCGCCCGCCTCCAGGGGCGGCTGCTGGAGGCTCTGCGCGCCGCTCTCGACCAGCACCAGCTGCTGCAGAGGCTGCGGACCATGCTGGCGGAGGAGCGCTCCGGGGCTGCTCGCGTCGCGCTGTGGGAGGCGCTACTCGACGCCCTGGGGGCGGAGCGCGGCACCGATGCCGCCAACCTCACGACCGCCGCCGCCGCCGCGCTCAGAGACACGCCGCGGGAACACGCGCCGCGCATATTACACGCACTCGGGTGCTACCTTCGAGCCGCTGGTCTGTGGGAGCGTCTCGTGCTCCTGGTCGAGCTGACGGTCGCCATGAACTACGCTCCCGCCGCGGTCTCCGCCCCCGACCCGGCCGTCCTCGCCGAGGCGGAGCGCCGCTCGCTGGAAATGGAGGATCAGGCGATCTCCAGCGGCCTGCCGCTTAGTGCCGTGTGGGTGCGCGTGGAGCGAGCGCGAGCGGCGTGTCACTGGCGGCCCGCGCTCCCGTCCTCCTCGGCGGACCGCTCCCCGCCCCCGGCCGACCCCCAGCGGATCCCGTTGCCGCACGACGTTGCTGACCTGCTGCTGCCGATGTCCGCGGACGACCACCTGTTCCACCTGTCCGTGCGTCTCCTTCTCCTGGTCAAGGTTCCCATGTTGCCCGCCACGGACCTCTGGACTCGCCGAGCGGGAAGGCTCGGTGCAGGCGGCGGCGGCGAGTCTCTCCTGCCGCTGGTGTGGGCGTGTCGGACGCTGCCCCCGGCCCACCCCGCTCGCCCTGCCCCGGAGCTCGCTCGTCGCCTGCTCGCGCTGCTAGTGGACCCTCCTCACTATTTTTCGGACGACACGGGTTATCTCACTTGGGTCAACTCACTGTGGGAGGTGTGTTGTTCGCGGGCGGGAGGGCGCTCGCGGACTGCCCTGGTATGTTGGAGGCTCCGCTGGCTCCAGACGCTGTCTCTCCTCTCGACCGAGGACGAGGCGGAAACGAGACGACTGCGGGCGGAGGGGCGGGCTCTGTTGCGGCGCTTCGAGACCGCCTCTCCTCTACCGTACGCCGAGTGTGCCAGGCTGGACTGGCTGGCGGCGGGAGGGGTCCGGGGCAAGGGGGCGGAGCGCGCACTGCAGGCGGCCGGCCGGGCCCTGAGAGCTGCGCTGGCAGACGACTCTTGTCCTCCTCAACATGCGCTCTTCGTGGCCAGGGTGGTGGACGAGATCGCGGGCGGGACCAGCGACGCCGGGGTCGCCGCCCTCGTGACGGCCGTCACCGGGCGCGACACGCGCGGCGGGGCCTCGGAGGACGAGCGGAGTCACGCGCTGCAACTATGCGAGGAACGCTGCGAGGACATCGAGCGGGGGCTGCTGGCGGCCGGCGAGGAGGAGGGCGAGGGCTCCGGGCCCGACACCTGGGTGGACCTGCTGCTGCCGGGACACGGCGAGTGGGCCCGGGCCCGGACCGCGCTGGCTGCACCCGCGAGGCGGGCCCAGCTGGTGGAGCGGGTGCGCAGCGCGGCGCCCGCGGCCCGTGGATCCCCCGCCGCCTGCTACTGGGAGGACGCGGCGGAGTCCCTGGGCCGCACGGCGCGAGTCGCGGCTCGCCTCACACCGCTGTTCCCGCACAACGCCGCGCTCGCGGTCGTGTCGGCGGGCGCTCCGCTGTGGTTGTCTCCGGCGGTGGCTCGCGGCAGAGGCCCCCGCGCCGGTGCGGCCGCCTTCGCTTCCTCGCTCCCCGCGTGGCTTGCGGCTCTGCGCACGGACTTCGCCCCGGCCGTTCCTCGCCGTCCTCGCAGTGGGACCGTTCCGTTCCGTGGAGATGTCTCTGATGGATCTTCTTCTGTTCCAGAGGCGGAGGCGCTAGTCCGCGTTTGTCGTCGCCTGTGCTCCGCGTCGGGAGCTGCCACGGACGAGGCGGCGCTGGCCTGGAGCGCCCGCATCGAGGCGGAGGCCCGCGCCCCGAGACCGCGGCTCCCTCACGCCCTGTTCGCCGCCCTGGAGCGCGCGCCGCAGTATAAGTGGCTGTACGTGCGAGGCGGGTCGTGGTGCGGGCGCGAGGCGGCCGTGTTGTCGGACGCCCTCCTGGAGCGCTCGCTGCGAGTCCACGCCCTGCTGGCCGAGTTGGAGCCGGTCCTCACCACACTCCCACGAGACGGTGAGGACGAGCGCCTGGTCCGGGACTAA

Protein sequence:

>DPOGS215583-PA
MALFPAYSSDAVENTTIKERVYISENPSQFIESELLASDSEESKDEPAEKLSRVFVRGPRPQSNDDFYIDRKLDRGNLRVSTLYYPGRPHLGCPKRSAGHTCSRHVSPTSHVARYECKVRRFSESRGEARSRRYFTRRVPDSDPDLTDLQERAAAYRDLLQRNPTDVALWETFIDFQERCGGQDAALRAVEEAVGRAPNSARLQGRLLEALRAALDQHQLLQRLRTMLAEERSGAARVALWEALLDALGAERGTDAANLTTAAAAALRDTPREHAPRILHALGCYLRAAGLWERLVLLVELTVAMNYAPAAVSAPDPAVLAEAERRSLEMEDQAISSGLPLSAVWVRVERARAACHWRPALPSSSADRSPPPADPQRIPLPHDVADLLLPMSADDHLFHLSVRLLLLVKVPMLPATDLWTRRAGRLGAGGGGESLLPLVWACRTLPPAHPARPAPELARRLLALLVDPPHYFSDDTGYLTWVNSLWEVCCSRAGGRSRTALVCWRLRWLQTLSLLSTEDEAETRRLRAEGRALLRRFETASPLPYAECARLDWLAAGGVRGKGAERALQAAGRALRAALADDSCPPQHALFVARVVDEIAGGTSDAGVAALVTAVTGRDTRGGASEDERSHALQLCEERCEDIERGLLAAGEEEGEGSGPDTWVDLLLPGHGEWARARTALAAPARRAQLVERVRSAAPAARGSPAACYWEDAAESLGRTARVAARLTPLFPHNAALAVVSAGAPLWLSPAVARGRGPRAGAAAFASSLPAWLAALRTDFAPAVPRRPRSGTVPFRGDVSDGSSSVPEAEALVRVCRRLCSASGAATDEAALAWSARIEAEARAPRPRLPHALFAALERAPQYKWLYVRGGSWCGREAAVLSDALLERSLRVHALLAELEPVLTTLPRDGEDERLVRD-