Monarch geneset OGS2.0

DPOGS209393
TranscriptDPOGS209393-TA1845 bp
ProteinDPOGS209393-PA614 aa
Genomic positionDPSCF300118 + 502963-504866
RNAseq coverage77x (Rank: top 65%)
Annotation
HeliconiusHMEL0070145e-0628.69% 
BombyxBGIBMGA014091-TA2e-0726.34% 
Drosophilapst-PF2e-2530.14% 
EBI UniRef50UniRef50_B0W1461e-5433.52%Putative uncharacterized protein n=4 Tax=Culicidae RepID=B0W146_CULQU
NCBI RefSeqXP_001657022.13e-5734.36%hypothetical protein AaeL_AAEL013784 [Aedes aegypti]
NCBI nr blastpgi|1571373286e-5634.36%hypothetical protein AaeL_AAEL013784 [Aedes aegypti]
NCBI nr blastxgi|1583009384e-5434.26%AGAP011771-PA [Anopheles gambiae str. PEST]
Group
KEGG pathway 
Orthology groupMCL19883 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209393-TA
ATGGAAATTAGCGAACGTGAGTGGAAAAGTAAAGCTAGAACCAACATTACTAAATATGTACACAGAGGTGGAGAATGTAAGGGAGACGAACCGCAGCTCATTGGAGGTAGTGAGTTTTTGAACACGATAGGTATTGTTATGGGCCTGCCGACCAGCGACCCTAACAAGTGGACAAAGTATGAAAGAGAAAAAGCATCAAGAGGAGAACTCGTTTTTTACGAAGGGAAATCTCGTGAAGGAGGAGATGACAAAAATTTTATTACCGTTCTCCCAATTGAACTCTATTATGAAGGAAAGTTATACGAATTACCACTCTTTAGAATCAAACGCTACAAAAACTCAAAGATCTATTACATTGATAACGTTGGACGTTGTTACGGTAGTTATGATAACTGGTACAATTTTAACAAATTGCCACCTGGGGAAATGGCTTATCCTTCGAAACTACAACTGTCTTTGAACCCCAACACCAAAGAGGCGTATGTCGTGTTTTCCGACACACCGACCTCTAGACTAGACGCCAAGGCCGCTCGCGGTCTGGACACCGTGGCGGCGGTGGCCGGGCTCACATCCTCCGCTGCGCTCCTGTTCGTATCGGGCGGACTCGCCGCTCCTCTTGTAGTGACCTCGCTGGTCACCGCCGGGTGGGGCACCACGCGGGCTGGCTATCAGATAGCGGAAAGAGCTGCGCACGGGGAAAGCGTCAACCCATTGACTAACTCAGATTCTCGGATGTTATGGCTGGGCGTGGCCGCCAGTCTCACCAGCTTCGGGGCGATGGGAGCCTCCATGAGACTGTCGTCTTTGGCGGCCCGTGGACGAGAAATATCTAACGCTTTCAAGATGTTCACCGACATCACTAACGGGGCGAATGTCGCCATTAGCGGGTTAGCGATTATAAACACTTCTATTGTCATGTATCAACATCGGGATGATTTAACGGCTGTAGATGTTCTGATGTACAGCGCCTCTGTAGCCTTTTGGTCGAAAGGGGTGTATTCTTACAAATCGGCGAATACAATAATTAAAGAGTTTCAAAACTATGCTTTCGCTCATATCAATAAACAATTGAACCAAGATCAAGCTTCTGAAATGAACCAAGTCCGTAGTAGGTTCGGCGATGATCCCGCCTTACTGAGGAGATTTGCCGCGGCCATGGAAAAAATATTAGCTCTCAATGACGATCAGATCAGAGCGTTCGATAGTTTCCGGAACCATCTCCGGGATGATATTAAATTAATAAGCGGGATAAGAAAAATATCTGAACATTACAACATAAATCCAGGAGAAACCATCGAAACCATTATCAATCTGTGGCAGGGGTCCGGTGGTTCGTCTCAGATAGTACCGATAGCTTCCGACACGATGCTCAAGGATGGAACTTTGATATTAGGAAGAGCTCCGCCTATTAAAATAACAGACCTGCCCCAACTCTCGCCTCCGATGATCCGCTTCCTCGGCGATCTGAGCCAGATCGACGTAGCGGGCAGCGCCCAGTGGTCGACGTCGGTGCCTATCTTACTGACCCTCCAGAACCGCGGGTTGTTTACGGTTTGCCCCGTCGTAAGAATCATTTCCGGCGGTAACGCTGTCGTGTCTCTCAACAACGTCCTGAATATCAGTATCCATAAATTGTACTCCATACCTAATGATGATTGCAAAACGATGCTTCATTTGATTGGTAACATGTCACCTACAGTTTCCAGTAACCAAATCACCAAAGAAGTGAAAACCCTGTGTGTAGTAAAATATAGATTGCGGTTCGAATGTCAGAGAAACGAAAGCGTCTCTTCGATCGCTAAAATTTTAAAAAGTCACAAGACATTGCACGATTTGGTCGAATAA

Protein sequence:

>DPOGS209393-PA
MEISEREWKSKARTNITKYVHRGGECKGDEPQLIGGSEFLNTIGIVMGLPTSDPNKWTKYEREKASRGELVFYEGKSREGGDDKNFITVLPIELYYEGKLYELPLFRIKRYKNSKIYYIDNVGRCYGSYDNWYNFNKLPPGEMAYPSKLQLSLNPNTKEAYVVFSDTPTSRLDAKAARGLDTVAAVAGLTSSAALLFVSGGLAAPLVVTSLVTAGWGTTRAGYQIAERAAHGESVNPLTNSDSRMLWLGVAASLTSFGAMGASMRLSSLAARGREISNAFKMFTDITNGANVAISGLAIINTSIVMYQHRDDLTAVDVLMYSASVAFWSKGVYSYKSANTIIKEFQNYAFAHINKQLNQDQASEMNQVRSRFGDDPALLRRFAAAMEKILALNDDQIRAFDSFRNHLRDDIKLISGIRKISEHYNINPGETIETIINLWQGSGGSSQIVPIASDTMLKDGTLILGRAPPIKITDLPQLSPPMIRFLGDLSQIDVAGSAQWSTSVPILLTLQNRGLFTVCPVVRIISGGNAVVSLNNVLNISIHKLYSIPNDDCKTMLHLIGNMSPTVSSNQITKEVKTLCVVKYRLRFECQRNESVSSIAKILKSHKTLHDLVE-