Monarch geneset OGS2.0

DPOGS200686
TranscriptDPOGS200686-TA1221 bp
ProteinDPOGS200686-PA406 aa
Genomic positionDPSCF300353 - 21664-28337
RNAseq coverage931x (Rank: top 14%)
Annotation
HeliconiusHMEL0083004e-17987.59% 
BombyxBGIBMGA008917-TA1e-16178.85% 
Drosophila% 
EBI UniRef50UniRef50_D6WXV67e-0939.78%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WXV6_TRICA
NCBI RefSeqXP_001815436.11e-0939.78%PREDICTED: hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|1892403813e-0839.78%PREDICTED: hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|1892403815e-0838.68%PREDICTED: hypothetical protein [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL25553 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200686-TA
ATGCTTGTGGGGGAGCAATCGCCGGCACCGCGGGTGGTGGCCGGCTTGCGAAAACTTCGTCCTGAACTGACTCCAGGGCCTGTTATAACGCCTGCTGACCCTAATCTAAGAATATCAGGTGTTCTGCGCCAGTATTACAATGAGGACGTCCATTCGTGGTCGTGGGTCCGCGTGGCCGTCCGTGGTGGGTGCTTGCTTGCGTGGCGCGACGGAACATTGCCGAGACGACCTGCTGCAAGACTCCCCCTGCGGCACCTACACCTTAGGGCTGCCGCAGCTCTCCCCAACGCCTTCCAGCTGTCCAGGCTGCGAGATGACTCCGCAGTTGCCACGTTTCAGGCTTGCAATGCGACGGAATACGCCCGTTGGGTACGCGCCCTGTGCGTCGAAATACTTGGACAGACGCCGCTGCCGCAAGTTAGATTTCTGGACGTACTTCCAGCAGCGGACACGAATAACAAGGAACCAAAGAAAATAACACTACAGGAGACGCCCATGTGTCCCCCCAGGCCTCCTCCCAGAGCTAGACGGAGATTGTTAACCGCGTCAGAGACTCAGCTGTCGAGAAGGGACCACTCGCCCGCTGCAACCGACGAAGGCATAGTCGTGGACCATGACGACTGCGACTCTTCGTCCGACCGCAGCCTCGACCTGTCCCTGTCTCTCGACACACTGAAGACCATCGACGTGGTAGATGCGCCAATAAGGAAAGCAGAGGTCGTTAAATGTGACAATTGCTGTAAACTAACCGCGAGTCCTCCACAGCACCACACGTTACCCCGAGCGAGAACCTCGGACACTGAACTCGGTCGACATCGGTATTTGAAACGCTGGGAGGGTACCACAGCTGGAGCTGAGAGAGGACGGTTCGCGATAGAAGCAGCGAGAAGAAAGACTGCCGCGTTAGAGAACAGAGCCAGGTCATGTTCACCGAACGCTCACGAAGTATCACAGTACGTTCCCGTTCGTGAGAGGCGAGCGCTATTCGAATCACTGTCACAAAGCGGTGGCAGCTTAGCTCGCAGTAGTGAACAGCTGGCTCGGCCGGTTATGGAGACGCCAAGGCGGGCAGCGTCCTTGCACGACCTGCAAGCGCCGCCGACACGTTCTGTTAGCGATTTACGTCAGTTCTTCGAAGCGGTGGCGCGAGGTGTCGGAGCCTGCTCGGGATTGTACACCCCTAATCCTGTCCCTCGCTTCACCTCGCTAGCCTGCGCCTGA

Protein sequence:

>DPOGS200686-PA
MLVGEQSPAPRVVAGLRKLRPELTPGPVITPADPNLRISGVLRQYYNEDVHSWSWVRVAVRGGCLLAWRDGTLPRRPAARLPLRHLHLRAAAALPNAFQLSRLRDDSAVATFQACNATEYARWVRALCVEILGQTPLPQVRFLDVLPAADTNNKEPKKITLQETPMCPPRPPPRARRRLLTASETQLSRRDHSPAATDEGIVVDHDDCDSSSDRSLDLSLSLDTLKTIDVVDAPIRKAEVVKCDNCCKLTASPPQHHTLPRARTSDTELGRHRYLKRWEGTTAGAERGRFAIEAARRKTAALENRARSCSPNAHEVSQYVPVRERRALFESLSQSGGSLARSSEQLARPVMETPRRAASLHDLQAPPTRSVSDLRQFFEAVARGVGACSGLYTPNPVPRFTSLACA-