Monarch geneset OGS2.0

DPOGS211993
TranscriptDPOGS211993-TA4443 bp
ProteinDPOGS211993-PA1480 aa
Genomic positionDPSCF300369 + 67979-98241
RNAseq coverage1762x (Rank: top 7%)
Annotation
HeliconiusHMEL0149850.071.45% 
BombyxBGIBMGA000117-TA0.063.17% 
Drosophila% 
EBI UniRef50UniRef50_D6WI873e-5732.56%Putative uncharacterized protein n=12 Tax=Eukaryota RepID=D6WI87_TRICA
NCBI RefSeqXP_970717.21e-6332.52%PREDICTED: similar to T19B10.5 [Tribolium castaneum]
NCBI nr blastpgi|1892354053e-6232.52%PREDICTED: similar to T19B10.5 [Tribolium castaneum]
NCBI nr blastxgi|1571350543e-7125.35%hypothetical protein AaeL_AAEL013228 [Aedes aegypti]
Group
Gene OntologyGO:00055152e-11protein binding
KEGG pathway 
InterPro domain[112-224] IPR0014782e-11PDZ/DHR/GLGF
Orthology groupMCL26765 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211993-TA
ATGGGTGTGATATTGAAATTATTTATTGTGACGGTTATTCTGCCCTGTGTTTTTGGGCAAGAACAAGAATGTTTTGGTGCTGGAAGCGTGGCCGGGGCAGCGATCGGTGGCTTTATTGCAGCACTTATTTTAATAGTAGCGGCGTATTACTTGAGGAAATTGTATTGTAAATCCCGTGAAGGGAAGCATATAATCCTGTCCAAAGACCCCGAGTCTGTGAAGGATGAATTCGCCTTCGACAACCCTGGCTTCAAAGAGGGCTGGCAGCATGAGGCACCAACCCTACCACTCGGAGGGAACATGACCTCCCAAATAAAAGCTGACTTCAACAACAAACCCGAGTTCAGCAAAGATGATTCCCATTTACAGTTAACCAAGATACGTCGAGTTAAATTGTGGGCTCGCGACTTCACGGGCTTAGGTTTAACGTGTGGGGGTGGCGCTAGAGATGGTGTTTGTATCCATTCTGTGATGAGAAGTGGCCCCGCTGCAGCCGCTATGCTGCAACCTGGTGATAAGATAAAGAGCATCAAAATAGAGTTCAATGGCACGCCGCTTGAGGATGCTGTAGCGATATTATCTCTGGCGTCTCCATATCCTGTCGAGTTGGAGGTGATGGAAGGTGGCAGAGTCAGTGGTGAGAGCTGGCCCGTTTACCATCCGCTGATGAAGGCTGGATCCACTGGCGATGTGAGCACGCTGGAAAAGGCGGGAAAACTCCTGCAACCACCGAAATCACCGAACGTATCGAACTCAAACAATTCAACCCTCGAAACGAAGCATTCGAAATCAGGAATAAAAAAGATAATCACAGAAAAGATAACGACTATAGAAAGAAATAAAAAGGAAAGAAAGGATGGACCCAGCACACTAGAGAGAGAAAACGAAAAAAATCTGAAACTCGCGAACAAAACCAGACATTCAGATGCGCCAACATCGAAACCTGAGAGGAGTAAAAATAGACTATCCTCAGGAAGTGACGTACAAATAATAGTCCCTGACAATAACTCGCACATAGAACAAGCGGAAGTACAGAAGAGAGAATACGATCCGAAGAGAGGAATGAAATTCGGTATAAGAGTCCTCCCACCGAACGTCCCCGACGATGGAGTGTTGAAGAAGAGCGTGGAGAATGGAGCTGTCACCGCCGAAAAGGCCATCGACGAACCAGACAAACCTGAACCACAAAGACAAGCACCCGTACATATAAAACACGAGAACGAAACGGATCAAAAGAAACCGGTTGTCGCTAAACGGAGAGAGAAGTTAGCGCCGCCTATACCTAACGCTAGATCTAAAGTGGAAATGGAGACTAGCCATGAAATATCAAAGACGTCAGACACCTCGACATCATCATTCAGTAGAACTGATCTGAACTCCAGCGGCATCAAGAGAGACGAGAACGGCATACCTCAAGAGTTACCTCAGCACATGTTCGACGCAGCCAAAGCAGCCAGGAGCAACAGAAAGAGCTCGTCAGATTTAGTGCAAGATAAGGAACCAAAAAAGGACGAAGCGCCGAAACCTGCAAAAAAATCTAAAGGCAAAGCACCTTCCCCGCCGGAACCGGAAAAGAAAGACAGCACTCTAGAAGACATAAAGAACTTACATGACTTTTTGAACAACGAAAAGCATCACTCAAGTCTACTACACGATACAGCAACTTCAACTAGAACCACGTACAACACATCGACTCCCAAAGCAAACAAAACCAAATCCAGAATGGACGAATCTATTAACTTCTCGCAAGAAGATATAGACGATATCGTCAAGCCGTTCAAGCGAGAGAGCGACTCCTTGAACAACTTCTTCGAGGACTCCAAATCTAATCTGTCATCTAACCAGGACGTACATTCTGTTATGTCGTTGAATCATAGCGACAAAAGCGACAGGGGCTCCACGACCATCGAACTCGACAATAGCGACATAACCATACACAGCTCTCCGCTAAACGAAACCGAACGGTCGGCGTCAGACGCCTCCCCGGTGGAGGATAATGAGCGGAAAGCGACATCTCTCGGTGACTTGTCGCGCTTCGAATTGAAAGCTAAGATTAACAAGCCGTCCGGCGGCACTTTGGAGAGGGCGCAAAGTTTGGATATATCGGTCGACGATGGCGACATCCAAGAGAGTACTCTATCGCCGAAGAAAAGAAAGGCCATGTCGGTGGTCGAATCCACGTTCTTTGATTCAGGCAACGAGGACATCTTACCTGAGATGATAGATCCGGATAAAGGAATTGTTATCAAGCATAAGGAACCGAGATTAAGCTTGAATATAGCGAAGACATCACGATATCTTCCAAACCTATGTAACCAGAGGAATCGGCTGAAAAAAGCTTCAGAATTCGGTAACTTAGAGGACGCTATCGTCAAAGGATCTAACAGTTCTATGGAGTCTGCGAGATATGAATCCCATGAGGCTATTTATGGGAAATCACAGTCTACCGCGCAAAAGACCCAAGAAGAACACGAAACATCAGATCACTTAGCGCGAAGAATAATGGACGAAAACTTGAAAGTTCACTTGAAGCTCGTATCAGAATTCGCCAAATCTACATCCGAGAGCAGTTCCAGTACATTGGATAACAGCCAGGAACAAAATATAACAAAAACTACAACTACATCTCCAATAAAATACGAGGAGAAAATAACTATGAGCTACGACACGAACGTTCCCGACGATATGAAAGTCTCGCGTAGCTCATACGCTAACAGCCTGGAGCGGCCTAAATCCGAAATGATGAAGAAACTTCTGGCCAAAAATCCCATATTTAATGTCAACATCGACCAAACACAATCTGCAAACAAATACGAAGCGAACACAAGCAAAGAGACCCCGGCACTGACGGACTCATTCAAATCATCTCACCACACACTACACCAACCTGACATAGTTAACTTCGATTTAAAAACATCACCGACATCGAAGGTCAGGGATTACGACGAATACATCAGCAACATAAGAGTCGGTTCAAATAACAACAGTTTGAAAATCAACAAACAACAACAGTTTAGCAGAGACTGGTCCGAGTCTAAGAATATAGAACAAGACAACGTCGTCACCATAAAAACCGGCGATGATTTGCCAGAGAAACGCAGCTCATATACCAAGTCCATAGAAATAGGTGAAAGTAATCGCTTAATACCCGATCTGGTGGAAGGAATAAAAAATAAAACAGACGAAAAACAAACCAGGACGTTGTACATGGAACCAGCTAACGTGTCGCTGACCATGACGCAAGAGCCGGTACAAAAGACGGTCACGGTTAATGTGTCGCTTCTGCTTATGCCCTTATTAGTCGTTACACAGAACGTGGAGAAAATTACCACGCAATACATAACGAGCAAAGTCGAGGCGCCGCTACAAGTCGAACAGATCAGCTTCGGAATGATGAGAGGATCTGACATCAACGAAATGGAGTTAGAAGAAGGAAACGTCAAGGATATCGATAGGAATGTTCTAGAAGAGATAAAGAGGAAGAATCCGAACATACATTTCACATCAACCGAGCCGTCTTATACAAGAACTGAGACTATTCTACTGAACACGACCAACATGGACGAAGAACAAGCGAGAGCGTTGATGGAAAAACTGCAAAACGACCCCAACTTTATGGCACAGAAATCTACAGAGGAATTATCACGAATGGGAATCCGAGTTTTTCATGACTTAGACGACAAATCAGATATGAACAAAGAAACAATGGAAGTCACAAAAACTAGATACGCCATAAATCCATCGACTATAATAAGCGAGACAAAATGCATCGGAACAACACAGGAAAAGGACAAAGAGACACAAAGAGATCACATAACAGAAATCCAAGTGCGACCAAAAACAGAAAAACAAGAGAACAAAACTGAATACAAAATAACGAACGCGCAAAGAAATCCAACGAAGGCGTACGAAGAACCGCTGCTGTCCTACGAACTGGACATAGAAATGTTGAACGACTTCATAATAAACGAGAGACATCACTCAGCGAAACACCTCGCCGAGCTCAAGAAACGAAACGCACAGAACGAGTCAAAGAAACGACATTCAGATTTCGACCTGCCAAGGAACAGCCACATAAAATTCAGAACAGCCACATACGAATCCCCCAAAGGAACCATCGTCACTAGCACAGATCTAGAAAATAGACGTCTATCACAGCTAGATCAAATGCAATTAAGATCGCCAAGTGAGGTGATACAGCCCCAAAAACCGGTGATATCGGCAAAACCGAGCAGCATACCGGTTAAAGACAAGAAACCGATGGGTTTCGTGTCATCAAAGATACCGGTGTTCGGTACCCAAAAATCGCTAAGCCAGGAGAATTTGACCGAAAAAACTTTCTCTATACCACGGTCCTCTCAGAATTTCAGTAGTAGCTCGGGTAATATTTCTATCACATCCATAAAATCTAGTTCCAAAAGTCCAAGCGGTGGCAGATTATGA

Protein sequence:

>DPOGS211993-PA
MGVILKLFIVTVILPCVFGQEQECFGAGSVAGAAIGGFIAALILIVAAYYLRKLYCKSREGKHIILSKDPESVKDEFAFDNPGFKEGWQHEAPTLPLGGNMTSQIKADFNNKPEFSKDDSHLQLTKIRRVKLWARDFTGLGLTCGGGARDGVCIHSVMRSGPAAAAMLQPGDKIKSIKIEFNGTPLEDAVAILSLASPYPVELEVMEGGRVSGESWPVYHPLMKAGSTGDVSTLEKAGKLLQPPKSPNVSNSNNSTLETKHSKSGIKKIITEKITTIERNKKERKDGPSTLERENEKNLKLANKTRHSDAPTSKPERSKNRLSSGSDVQIIVPDNNSHIEQAEVQKREYDPKRGMKFGIRVLPPNVPDDGVLKKSVENGAVTAEKAIDEPDKPEPQRQAPVHIKHENETDQKKPVVAKRREKLAPPIPNARSKVEMETSHEISKTSDTSTSSFSRTDLNSSGIKRDENGIPQELPQHMFDAAKAARSNRKSSSDLVQDKEPKKDEAPKPAKKSKGKAPSPPEPEKKDSTLEDIKNLHDFLNNEKHHSSLLHDTATSTRTTYNTSTPKANKTKSRMDESINFSQEDIDDIVKPFKRESDSLNNFFEDSKSNLSSNQDVHSVMSLNHSDKSDRGSTTIELDNSDITIHSSPLNETERSASDASPVEDNERKATSLGDLSRFELKAKINKPSGGTLERAQSLDISVDDGDIQESTLSPKKRKAMSVVESTFFDSGNEDILPEMIDPDKGIVIKHKEPRLSLNIAKTSRYLPNLCNQRNRLKKASEFGNLEDAIVKGSNSSMESARYESHEAIYGKSQSTAQKTQEEHETSDHLARRIMDENLKVHLKLVSEFAKSTSESSSSTLDNSQEQNITKTTTTSPIKYEEKITMSYDTNVPDDMKVSRSSYANSLERPKSEMMKKLLAKNPIFNVNIDQTQSANKYEANTSKETPALTDSFKSSHHTLHQPDIVNFDLKTSPTSKVRDYDEYISNIRVGSNNNSLKINKQQQFSRDWSESKNIEQDNVVTIKTGDDLPEKRSSYTKSIEIGESNRLIPDLVEGIKNKTDEKQTRTLYMEPANVSLTMTQEPVQKTVTVNVSLLLMPLLVVTQNVEKITTQYITSKVEAPLQVEQISFGMMRGSDINEMELEEGNVKDIDRNVLEEIKRKNPNIHFTSTEPSYTRTETILLNTTNMDEEQARALMEKLQNDPNFMAQKSTEELSRMGIRVFHDLDDKSDMNKETMEVTKTRYAINPSTIISETKCIGTTQEKDKETQRDHITEIQVRPKTEKQENKTEYKITNAQRNPTKAYEEPLLSYELDIEMLNDFIINERHHSAKHLAELKKRNAQNESKKRHSDFDLPRNSHIKFRTATYESPKGTIVTSTDLENRRLSQLDQMQLRSPSEVIQPQKPVISAKPSSIPVKDKKPMGFVSSKIPVFGTQKSLSQENLTEKTFSIPRSSQNFSSSSGNISITSIKSSSKSPSGGRL-