Monarch geneset OGS2.0

DPOGS206802
TranscriptDPOGS206802-TA3639 bp
ProteinDPOGS206802-PA1212 aa
Genomic positionDPSCF300001 - 4232974-4249632
RNAseq coverage561x (Rank: top 23%)
Annotation
HeliconiusHMEL0157940.074.40% 
BombyxBGIBMGA000603-TA0.070.12% 
Drosophilasyd-PA0.051.91% 
EBI UniRef50UniRef50_D6WEI20.054.99%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WEI2_TRICA
NCBI RefSeqXP_973389.20.053.81%PREDICTED: similar to jnk/sapk-associated protein [Tribolium castaneum]
NCBI nr blastpgi|1892350380.053.81%PREDICTED: similar to jnk/sapk-associated protein [Tribolium castaneum]
NCBI nr blastxgi|1892350380.053.90%PREDICTED: similar to jnk/sapk-associated protein [Tribolium castaneum]
Group
Gene OntologyGO:00055153e-06protein binding
KEGG pathwayxtr:1001701860.0 
 K04436 (MAPK8IP3, JIP3)maps-> MAPK signaling pathway
InterPro domain[32-187] IPR0191431.3e-60JNK/Rab-associated protein-1, N-terminal
[654-1073] IPR0110472.3e-08Quinonprotein alcohol dehydrogenase-like
[857-1009] IPR0159433e-06WD40/YVTN repeat-like-containing domain
Orthology groupMCL11323 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206802-TA
ATGAATTTCGACGACAGTGCGAGTAGTGGCTATGCCACGGCCGAAACCATATACGGGACCCACGAAGATAGTCATGTTGTTATGTCGGAGAAGGTTCAATCTCTCGCGGGGAGTATTTATCAAGAGTTTGAGAAGATGATAGCTCGTTATGACGAGGATGTGGTGAAGACTTTGATGCCACTATTGGTGAATGTGCTGGAGTGCCTGGACTCTGCATACCAAACCAACCAAGAACATGAAGTGGAACTAGAACTCCTTCGTGAAGACAACGAACAACTTGTCACGCAATACGAACGAGAAAAATCAGCCAGGAAACATTCTGAGCAAAAGCTACTAGAAGCTGAAGATCATTATGAAGGTGAAAGGAAAGATTTGACAGGACGTCTCGAGGCTTTGGACAGCATTGTTCGTATGCTGGAGTTGAAGCATAAAAATTCCCTGGACCATGCTAGTAGACTTGAAGAAAGAGAGAATGAACTTAAAAAGGAATATGCAAAACTCCATGAGCGCTACACTGAACTGTTCAAGACTCATATGGATTATATGGAGCGGACTAAGTTACTGCTGAGTACGGCCGGAGACAGGAGCGAGGTCCGAGGCAGGTTGGGGCTCAACCCTGTGGCCAGGTCCAGTGGTCCTATATCATTCGGGTTCGCGTCCTTAGAGGGCGCTGTTGCCGAGCAAACTGAGAGCATCCCCGCCAGCCCTGACAGCTTGTCCTCCTCGCCACCAACACCACCGCTCGGTTCCGAATTAGCAGCGGTACGTACAGTGGAGCGAGCTCATCAGACGGACCCTATTACACAACATAGTCAAGCAACGTCCCCGGTAACTCCTCAAGCACCACCTGGTCGTTCGCAGACCAAGAGTGAAAAACGAAGCGGGAACACCCTGTATCAAGAGCTGGCGTTCCACGACACAGACGGTGGTGTGGAGGGGGACGATGGCGCGGACATTACTGGCAGCTGGGTCCACCCAGGGGAATACGCGTCCTCAGGTATGGGGAAGGAAGTTGAGAATCTAATTTTGGAGAATAATGAACTTCTGGCGACCAAGAACGCCCTGAATGTGGTGAAGGATGACCTCATAGTGAAGGTGGACGAGCTTACCGGCGAGCAGGAGATCCTGAGAGAGGAGGTGGCAGCCCTGAGCTCGGCGCGGGAGAGACTCAGGGATAGGGTCTCGCACCTAGAAGAAGAACTCAGACATCTCAAGGAGACGGTGATAACCAGCGCGGGGAGCGGAGCGGGTGCGGAAGCGGAAGAGGAAGCGGACGTACCAATGGCGCAACGTCGCAAATTCACTCGTGTGGAGATGGCAAGAGTTCTCATGGAGAGGAACCAGTATAAGGAGCGCTTCATGGAGCTTCAAGACGCTGTACGTTGGACGGAGATGGTGCGAGCGAGTAGGGCCGACTCATCTATGGATAAGAAAAGCAAACAGAGCATTTGGAAGTTTTTTAGCAATCTCTTCAGTTCTCCGGAGCGACCCCAGCGTCCCCTATCGACGCCACACCTGCCCGCGGTCGCCGCCCCTACCGCCTCCATGAGACACTCACACACAGAGAACTTTCTAGAGACGCAACTATCGGATCACGGGCACGCACACACGCGCAGACACGAGCATACCCGCACCGACCAGTTCCGACAGGTGCGTGCTCACGTTCGCAAAGAGGACGGTCGCACACAGGCGTACGGTTGGAGCCTGCCTTGTAAGAACTCACAGGCACATAAAGGAACATCACTATCGCTCTCGTCAGGAGGTGTTCCTGTTCCGGTACCGGTTTTTTGCAGACCTATAGCCGAGTCGGAGCCTCAGATGACTCTACTGTGTGCGGCTGGTGTAGAGAGATCAGTTCCGACCGGTCGAACGGACAGACATTCAGATGATTATGGAGAGGATCTCAGTCTAGCTCACGAGATGAACAATGCCCAGTGTGACGATGGAACAAGGAAGCTCTCCTCTCTCGTGTGGATCTGTTCCAGCACACAGAACAAGTGTATCGTTATGATTATAGACGCAAATAATCCTGGCGAGGTGGTGGAGTCCTTCCCCGTGCCAGATAAGCACATCTTGTGCGTGGCGTCGGTCCCTGGTGCAAGTACCGCGGATTACCTCGAGGCGAAAGGAAGGGAGGCTGCTGTAACCAACGTTGTACAACCAGAGGAGAATAAACGTCATTCGCAATCACTATTCTGCGTGGGCAAGAGTGATTCTACAGAGAGTCAGAGCAATGGTACGGTGACATCAAGCAGCAACGAGAACGGTGATAAAGAGGTTATTGTTGGGAGTACGAGATTGGTTGAGGCGAAGCTTCTAACACCGCCCAGTACTCCAGTGAAGGAGACTCCAAAGAAGGAGAGTGAAGTCAAGGAAGGGAACGCGGAAGTTGATACTGAACCAGCTGACAGCCCGCCACTGTCATCAACACCAGTCAATGCGAACGAAGCTAGAAACTCTCCTGAACCAGGTGTTGACGCGTTCGCCCAGCAGCAGAGCTGTGATACAGCTGGAGCTAGTGCTGGAAGCGAAATCAGCACCGCAATGAATACCATGTGGCTGGGCACTAAGAGCGGCAACCTATACGTGTACTCCTCCGTGAAGAATTACAGCAAATGTCTGGCAATGGTGAAGTTAAACGACGCTATCCTGTCTATCGTATGGTGTTCGTCTCGATGTGTGGCCGCTCTTGCTGACGGTACTGTGGCTGTGTTCGCGAGAAAGACGGATTTAATGTGGGACTTGACAAGCTACTGGCTACTCACATTGGGGGATCCGAAGTGTTCTGTGCGATGTCTGAGCGCGGTTGGACCTCCCGCGTCTGGTACCGTCTGGTGCGGGTACCGTAACAGAGTGTTGGTGGTAGAGCCGCGCTCACGACGGGTACTACACTCACTGGAGGCGCATCCGCGACACGAGAGCCAAGTGCGACAACTCGCGGCCTACAGGGACGGTGTGTGGGTATCAATTAAAATGGATTCAGCGTTACGTTTATACCACGCCACTACATACGATCATCTCATGGATGTGGACATCGAGCCTTACGTCAGCAAGATGCTGGGTACCGGCAAGCTTGGCTTCTCCCTAGTACGAATAACAGTGCTGCTGATTAGTTCCGGACGTCTGTGGATCGGCACCAGCAACGGCGTAGTGATCAGTGTGCCGCTCTCAGACGCACCAATACCATCCGGGAACACGGCACTGACAACCACGTCACGGTCTTCCGTACCAGGTCACGCGGCGTTGATCCGAGCTGGTGTGAACGTGCCGCCTGGAAGTTGTATACCGCTCTGTTCGATGGCCCAAGCTCAGCTCAGTTTCCACGGACATCGCGACGCAGTAACGTTCTTTGTGGCGGTGCCGGGATCAACGACGCCACGGTCGCCGAGTACTCCTCCGGCCGGAGCGCCCGCCACGCCGCCCTCACCGACACCAGCGTCACCCGCCACGTCGCCGCCGCCTCCCCCACCACCGCCGATGCTAGTCATCTCAGGCGGGGAGGGATACATCGACTTTAGAATAGCGGATACCGGTATGGAGGAGAGCGTGGTGGTGGCAGAGGACGGCTCCAGCGGTCACAGTTCTCTGGACCGCGGCGCTCGCTCGCACTTGATTGTGTGGCAAGTGGCGGCTGCCTAG

Protein sequence:

>DPOGS206802-PA
MNFDDSASSGYATAETIYGTHEDSHVVMSEKVQSLAGSIYQEFEKMIARYDEDVVKTLMPLLVNVLECLDSAYQTNQEHEVELELLREDNEQLVTQYEREKSARKHSEQKLLEAEDHYEGERKDLTGRLEALDSIVRMLELKHKNSLDHASRLEERENELKKEYAKLHERYTELFKTHMDYMERTKLLLSTAGDRSEVRGRLGLNPVARSSGPISFGFASLEGAVAEQTESIPASPDSLSSSPPTPPLGSELAAVRTVERAHQTDPITQHSQATSPVTPQAPPGRSQTKSEKRSGNTLYQELAFHDTDGGVEGDDGADITGSWVHPGEYASSGMGKEVENLILENNELLATKNALNVVKDDLIVKVDELTGEQEILREEVAALSSARERLRDRVSHLEEELRHLKETVITSAGSGAGAEAEEEADVPMAQRRKFTRVEMARVLMERNQYKERFMELQDAVRWTEMVRASRADSSMDKKSKQSIWKFFSNLFSSPERPQRPLSTPHLPAVAAPTASMRHSHTENFLETQLSDHGHAHTRRHEHTRTDQFRQVRAHVRKEDGRTQAYGWSLPCKNSQAHKGTSLSLSSGGVPVPVPVFCRPIAESEPQMTLLCAAGVERSVPTGRTDRHSDDYGEDLSLAHEMNNAQCDDGTRKLSSLVWICSSTQNKCIVMIIDANNPGEVVESFPVPDKHILCVASVPGASTADYLEAKGREAAVTNVVQPEENKRHSQSLFCVGKSDSTESQSNGTVTSSSNENGDKEVIVGSTRLVEAKLLTPPSTPVKETPKKESEVKEGNAEVDTEPADSPPLSSTPVNANEARNSPEPGVDAFAQQQSCDTAGASAGSEISTAMNTMWLGTKSGNLYVYSSVKNYSKCLAMVKLNDAILSIVWCSSRCVAALADGTVAVFARKTDLMWDLTSYWLLTLGDPKCSVRCLSAVGPPASGTVWCGYRNRVLVVEPRSRRVLHSLEAHPRHESQVRQLAAYRDGVWVSIKMDSALRLYHATTYDHLMDVDIEPYVSKMLGTGKLGFSLVRITVLLISSGRLWIGTSNGVVISVPLSDAPIPSGNTALTTTSRSSVPGHAALIRAGVNVPPGSCIPLCSMAQAQLSFHGHRDAVTFFVAVPGSTTPRSPSTPPAGAPATPPSPTPASPATSPPPPPPPPMLVISGGEGYIDFRIADTGMEESVVVAEDGSSGHSSLDRGARSHLIVWQVAAA-