Monarch geneset OGS2.0

DPOGS203655
TranscriptDPOGS203655-TA1512 bp
ProteinDPOGS203655-PA503 aa
Genomic positionDPSCF300010 - 2764117-2769213
RNAseq coverage653x (Rank: top 20%)
Annotation
HeliconiusHMEL0069406e-10262.91% 
BombyxBGIBMGA003452-TA2e-8060.67% 
DrosophilaCG17377-PC4e-1058.00% 
EBI UniRef50UniRef50_D6X0N33e-1347.06%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X0N3_TRICA
NCBI RefSeqXP_968512.21e-1054.24%PREDICTED: similar to CG17377 CG17377-PA [Tribolium castaneum]
NCBI nr blastpgi|2700135561e-1247.06%hypothetical protein TcasGA2_TC012174 [Tribolium castaneum]
NCBI nr blastxgi|1892355592e-2432.83%PREDICTED: similar to CG17377 CG17377-PA [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL24904 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203655-TA
ATGAGCGCTCCCTTCTACAGTTTTGACCCATGTGTATGCACCGGAATGCCATGCGGTGCCTGCTATCCATACTCTACGAAGCCGTGTGTGGCCTCGCGAAACGATCTGAGTCCTTGTTCAGTTTGTCCTGGTGGGGGATGTTCTCCATGCCCAGCGCCAGTACGACCATGCCCGGCGCCCTGTATACCACCACCTGTACTACCATGTTTGCCTCCATGCCCTCCGTGTGATAAGCCCGTAAACCGAGTCACACCACCAGTACCTGTATGTGACTCAGTGGCAATACCGTGCTGCAGACCAAATCGCTGCAAGCCTTGTCCTTGTTACTGCTGTACAGAACCTGGGGTTTGTTCTCCGAAGAGATGTAGTTACCCTGGGGTTCCTGTATGTGGAGGTTGCTGTGGCCCTTGTATGCCAGTGTGGTGTTCTCCATGTGGTCCGATGGTATGCTATAAATATGAAGACTGCGGCCCACCATTGCAGTGCCTGCCGCGATCTGCGTTTAAGCGTAATGGACTGCAGGATTGCCAGTTCTTCCAGCAAGGGTCGATCCGATATTTGTGTCGGGCGGGCCCTTGCTGTCCGAGGCCATGTCCACCGATCTGTCCGCCGTGCGTTCGTCGCCCTCCACCTGGAAAGACCTACGTTACTCGCTTCAGCGATTTTGATCTGATCCTGAAGTCCAATGATGCAGCCCCGAGGACGGAGGAGAGTGGTACAGAAAAAAAGGATGAAAAAGAAAATCAACTTAAAAGGACCAAGAGCCGTGAGCTAAGAGGCGGTATTATGTATTACAGTTGCCACTGCATTAAACGAAATGGGCTTCAACATGACTGCAGAAGAACTGGTTGTGGAGGGGAGCCTGCATGTCTTGTGCTGCCAGATCCTCTTTGTGCGCCTAGTCAACTTGCTCGTAATCGGGGACTTGCAGATCCCGCACCTTTAGCAGCTCATTATCGCGCCCAAGGTGGAGGGTCAGCAGGGCCAGCAGAATCAAGTGCAGGTTCTGGCGGTGGCAAAAGATTTATTGTTTGTGAATTGAAGGGAATCGTTCCTGATGTGTCTACAAAGGCTGAGCAATGTTGTAAGTGTCCACCTTTTAACACCAAAGTTAATGATTTAAACTTTACAAATCAGTCTGCAAACTGTGGATGTGATTCCAATAACAACCCTACCAATTCATTATTTGTTTTCGATGAAAAAACAATTGAAACTATTTTAAGCCGCTTTCAAAATAAAGAATTAGCACCTAGTAATATGGTCCGTGGACCGGGCGGACGATTACGTGAGGTTAAAGCTGTAGTTAAGGAAGAGCCTTGTACAAGACCAGGTTGCGTACATTGGAAACCACCTCCACCTTGCTTATGGGATGCCCCTTGCAAAGCAGATTGCTTTGAGACTCCGCCTGGTATTCAGCCTGGTCGAAATCCTTTAAGTAGCAGTGGCGGCTCTAAAATACCCAATCTAGTAGCTCCACATAAGAAAATGGTAGTTCTTTCACCAATACAATAA

Protein sequence:

>DPOGS203655-PA
MSAPFYSFDPCVCTGMPCGACYPYSTKPCVASRNDLSPCSVCPGGGCSPCPAPVRPCPAPCIPPPVLPCLPPCPPCDKPVNRVTPPVPVCDSVAIPCCRPNRCKPCPCYCCTEPGVCSPKRCSYPGVPVCGGCCGPCMPVWCSPCGPMVCYKYEDCGPPLQCLPRSAFKRNGLQDCQFFQQGSIRYLCRAGPCCPRPCPPICPPCVRRPPPGKTYVTRFSDFDLILKSNDAAPRTEESGTEKKDEKENQLKRTKSRELRGGIMYYSCHCIKRNGLQHDCRRTGCGGEPACLVLPDPLCAPSQLARNRGLADPAPLAAHYRAQGGGSAGPAESSAGSGGGKRFIVCELKGIVPDVSTKAEQCCKCPPFNTKVNDLNFTNQSANCGCDSNNNPTNSLFVFDEKTIETILSRFQNKELAPSNMVRGPGGRLREVKAVVKEEPCTRPGCVHWKPPPPCLWDAPCKADCFETPPGIQPGRNPLSSSGGSKIPNLVAPHKKMVVLSPIQ-