Monarch geneset OGS2.0

DPOGS210489
TranscriptDPOGS210489-TA1317 bp
ProteinDPOGS210489-PA438 aa
Genomic positionDPSCF300186 - 264948-266264
RNAseq coverage191x (Rank: top 48%)
Annotation
HeliconiusHMEL0163442e-16372.36% 
BombyxBGIBMGA012580-TA3e-7366.98% 
DrosophilaCG10283-PD8e-1644.68% 
EBI UniRef50UniRef50_D6WGU92e-2634.92%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WGU9_TRICA
NCBI RefSeqXP_967502.14e-2734.92%PREDICTED: similar to GH02263p [Tribolium castaneum]
NCBI nr blastpgi|910797767e-2634.92%PREDICTED: similar to GH02263p [Tribolium castaneum]
NCBI nr blastxgi|910797768e-3327.99%PREDICTED: similar to GH02263p [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL17208 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210489-TA
ATGTCGCGATTCAAACCGACGGATACAATGCCGAAAATGGGCCATGTGACTGATCTTCGAGAGCAGTGGCTCGTGAGAACGGACGGAGCGTTGGCCCATCATCTTCAAGACCGCGAGATCTCTTCACATCTCTGTGAAAACAGGTTCCGTAACCAGCAAATCAGAGAGGATTTTCCTTTGGCTTTAAACGAACAGAGGGATATTGAATATGAGTTACGTTTACGGGAAATGGCTCGCCGACGCCAGGACGATATAGACGCAGAGATAGCCCGGAAAATTGCTGATATTAACATTCATAAACAACATCGTCATCTTCCTAACCATAATCCAGTTCAAGCTCCTCAACCCAGTTCATCAAGAGCCACTTTTTCCTCCTCACAAATCGAACTAGAAGCTCCTCCTAGTTTGGCTGTTGCCATCGATCCTAAAGAGTTAGGTCTTCCACCCAATGAGATTAAAGAAATTTTAAGTAGATTAGAACAGGAGGAAAGAGATGCTAAGTTAGCTAGAGAATTAAGTCATGAGTCTCAAGGTCAGGATGCTTTACTAGCTGATATGAAGTGTGCAGTTGAAGCCCAGGACAGTGAATTAGCAAGACTGCTACAGGAAAGGGAATATAAAAAGTTACAACGGGCTAAAGAAAAAGCTAGACAAAAAGCTCTATTAAAAAAACAGCAGCGTCTAGCTCAACAGGATCCACTGCCGGAACCTCCACAACCAACATTGTGTGATGCATACAGTAACCCTCGAGATATCATTAGACAGAACGGTTTGCGGCTTTGTAAATCTGTTGATTCAGATTTAAATTATGTTGAACCATTTGAAAAAGAAAACATACAATCAAAGGAAAGTGCCCTTTTAGCCAAAGAAATGTCAAAACAAATGTTTGCCATTGGAGCTGGAACTTCAAACACAAACCTACATGTGTCCCACAAGCACACAAAGTCCTTTGAAACTGAAAAACCGAGCTCATCAAGCTCACTGAACTACAGGCAGCCGCCACCCCCACCCACTGGTGGGAAAAAGCCTCGATTCCCAGACCCTATGAGTATCAGACCAGCATCACACTACACTCCTGACAATGTGTCATCTGGCTCAGACGGCCAAAACACATTTATGTCACATAGCGTCAGTGAAAACAACAATTTATACAGTTCCACATATCCACCTGATAATATAAATACATCACAACTCGGTGAGAGAGCGGATCCTTCGAAGAGGCTGTCGGTAATATCAGACATTACGGCCGAAGGTTTGGCTAAGAAGAAGAGCAAGCAGTCTGAGAACAAAAAGAAAAAGGGCTGCAAAATTCAATAG

Protein sequence:

>DPOGS210489-PA
MSRFKPTDTMPKMGHVTDLREQWLVRTDGALAHHLQDREISSHLCENRFRNQQIREDFPLALNEQRDIEYELRLREMARRRQDDIDAEIARKIADINIHKQHRHLPNHNPVQAPQPSSSRATFSSSQIELEAPPSLAVAIDPKELGLPPNEIKEILSRLEQEERDAKLARELSHESQGQDALLADMKCAVEAQDSELARLLQEREYKKLQRAKEKARQKALLKKQQRLAQQDPLPEPPQPTLCDAYSNPRDIIRQNGLRLCKSVDSDLNYVEPFEKENIQSKESALLAKEMSKQMFAIGAGTSNTNLHVSHKHTKSFETEKPSSSSSLNYRQPPPPPTGGKKPRFPDPMSIRPASHYTPDNVSSGSDGQNTFMSHSVSENNNLYSSTYPPDNINTSQLGERADPSKRLSVISDITAEGLAKKKSKQSENKKKKGCKIQ-