Monarch geneset OGS2.0

DPOGS205929
TranscriptDPOGS205929-TA1683 bp
ProteinDPOGS205929-PA560 aa
Genomic positionDPSCF300156 - 293815-296577
RNAseq coverage78x (Rank: top 65%)
Annotation
HeliconiusHMEL0064480.069.20% 
BombyxBGIBMGA002861-TA0.066.32% 
Drosophilasprt-PA4e-0830.28% 
EBI UniRef50UniRef50_D6WZN45e-4233.20%Putative uncharacterized protein n=3 Tax=Tribolium castaneum RepID=D6WZN4_TRICA
NCBI RefSeqXP_973549.19e-4332.95%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|910908982e-4132.95%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|2700132277e-4634.69%hypothetical protein TcasGA2_TC011803 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL26621 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205929-TA
ATGAAAAAAGCTCGTCCCAGGTCAGTCGTGGTATGTTTTAATATGTCTTTATATCACAGTACGGTCCGTGACTGCGGCGGCAGTGCGGTGTTGGTGTCAGCGGAGGCCCGGGACAACACATGGAGTGCCGCCATCCGGGGAGCCGCAGTCGCCGTGGAGTGGAGCGACCACCCTGCCCTCAAGCAAGGCGACCGAGTTCTGGAAGTAAACGGCACGTCGACACTGTGGTGCGGGTCCGAGGAGCTCCGGCGCGCCCTGAGCGCAGCCTCGCCCGCACGCCTCGTGCTGCTCCGAGCGCCGCCGCCCGCACTCACACAGAGCGAGGCGGCATCTTTGCGCTCCGAGCTGGGCGCGCTACAGGCGGCGGCGGAACACGCGGAGAGAGCCAAGCAGGGACTGAGGCAGGACAACACCAGGCTCACCCACCGCATCTCATACCTCGAGGAACAGGTCGCCGAGCTACTGGCGCGACACCCGCAGCTCCAACCAGTAAGCAGTGCCGACAGCTGTATCACTGTCAACAAAACGAAGAGAAATGTTACCAACATCAACATCACCTCGGAGCCGGCGACCAAACAGACCCCAAAATCTGAGGTGCAGGTGTTCCAAAAGGGACCCGACATCACGGCCATAGTCGCCAAGTTACCAGGACTGGACGGGGCCGAGTCGAGCAATCTGCCCGTGGTGAGGCCGAGATCTAACGCGTCGGGAGCCTCCTCGAGACTGGCGACCTCTCCCCGGACCTCCCCGGCCCCCAACAGGCGGGCCCACAGCTCACACAGCCTGGAGCACCGCGCGGGCCGGCTGCGACATTCCCTCTCACATCACTGCATCCACGACTACAGCGCGGAGACCGACGCCGCCATCAGAATGATAGAGAGGAACCAGAGACATATGGAGAGGCAGAGGATCGACACCGAGAAATACAATAGAACGAAGCGGGAGGATCGGTTGAGGTCCGACAGCGACCTGGACGTCCAGGGAAAGGAGAACACCAGGAACGACATAAAGAAGGCGGCGGAAAGAATAGAACAGAGCATCAAGAAGACGAACTGGGCCGAGAGGAAGACGCTGTCCATCATAGAGCAGCTGAAGAGATCACAGCGGATGCGCAAAATGAAGAAGAACGAAAGTTCCGAGAACATCCAAGTGGAGAAGGAGCGGCATCACTCCTACAGCAGGATAGACGGGAAGGTGCTGGAGAACGCGCACAAGTCGAGCCGGAGGACGGCCAAGGGGCAGTCCCGGAGCGCCAAGTCGTCGGAGTTCGAGTCCGAGTGCTCCGACTACCCGGGCGGGGACGTGTTCAGCAACTCGCCGCGGCTGGCCGCCACCGACTACGACGAGAACAGTTCCAGGACGACTCGCTACCTCAAGATAGACGACCTCCGGGAAGACTGCAAGTCCAGACCGACGCCCCCCAGGAAGCCGCTGAGACTGTCACTGCACAAGGCGAAAAGCGCCCACTCGCTAGTCAACGGCAGCGAGTCGGAGGCGAGCAGGCCGCCGTCCGAGGCGCAGAACGGAGACGCCGCGTCACACTGCAAGAGGCCCGTGAAGCGCTCGCACGCCGCGGACAAACACACCAGGGAGAGGCTTCGGGACCCGCGAGCCACGCCGAGGCCTGCTCGACGGACCGGGCCCGCCCCGCCCGCCGACAGGCTGTACTCGGACAAGTGGTGA

Protein sequence:

>DPOGS205929-PA
MKKARPRSVVVCFNMSLYHSTVRDCGGSAVLVSAEARDNTWSAAIRGAAVAVEWSDHPALKQGDRVLEVNGTSTLWCGSEELRRALSAASPARLVLLRAPPPALTQSEAASLRSELGALQAAAEHAERAKQGLRQDNTRLTHRISYLEEQVAELLARHPQLQPVSSADSCITVNKTKRNVTNINITSEPATKQTPKSEVQVFQKGPDITAIVAKLPGLDGAESSNLPVVRPRSNASGASSRLATSPRTSPAPNRRAHSSHSLEHRAGRLRHSLSHHCIHDYSAETDAAIRMIERNQRHMERQRIDTEKYNRTKREDRLRSDSDLDVQGKENTRNDIKKAAERIEQSIKKTNWAERKTLSIIEQLKRSQRMRKMKKNESSENIQVEKERHHSYSRIDGKVLENAHKSSRRTAKGQSRSAKSSEFESECSDYPGGDVFSNSPRLAATDYDENSSRTTRYLKIDDLREDCKSRPTPPRKPLRLSLHKAKSAHSLVNGSESEASRPPSEAQNGDAASHCKRPVKRSHAADKHTRERLRDPRATPRPARRTGPAPPADRLYSDKW-