Monarch geneset OGS2.0

DPOGS200478
TranscriptDPOGS200478-TA3798 bp
ProteinDPOGS200478-PA1265 aa
Genomic positionDPSCF300158 - 281627-339989
RNAseq coverage303x (Rank: top 37%)
Annotation
HeliconiusHMEL0060352e-14660.86% 
BombyxBGIBMGA010412-TA5e-11352.35% 
Drosophilabaz-PB6e-1125.09% 
EBI UniRef50UniRef50_E0VDS89e-7440.43%Multiple pdz domain protein, putative n=1 Tax=Pediculus humanus corporis RepID=E0VDS8_PEDHC
NCBI RefSeqXP_001606745.19e-8636.19%PREDICTED: hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3454816145e-8536.06%PREDICTED: hypothetical protein LOC100123134 [Nasonia vitripennis]
NCBI nr blastxgi|910775442e-11536.42%PREDICTED: similar to Tyrosine-protein phosphatase non-receptor type 13 (Protein-tyrosine phosphatase 1E) (PTP-E1) (hPTPE1) (PTP-BAS) (Protein-tyrosine phosphatase PTPL1) (Fas-associated protein-tyrosine phosphatase 1) (FAP-1) [Tribolium castaneum]
Group
Gene OntologyGO:00055159.8e-24protein binding
KEGG pathway 
InterPro domain[586-711] IPR0014789.8e-24PDZ/DHR/GLGF
Orthology groupMCL24787 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200478-TA
ATGGCGGACAAAGAGGACACGAGTGATAGTGATGGCCCAATAATAGATGACAACGTTATCCGAATACAGCTTAAAAAACCTACACATAGGAAATGGGAGCTGCGAACCAGAAACCTTTCACCCGGTCTCAAGTTCCCAACTATACAGCTTTTTGATTCCGAAGGGAATCTATTAGTCGAAACAAACGATGAGAATATATCGATTACAAGCCTAGATCACGATGAGTCCCCTCTTAGAACAAGTCAGTCCTTAAGGGAGAAAGATTACAAGCACATAGTTAGGAAAAAAATTATAAGAAAGCACAACACTATATCAGGTGATAGAATATATACCCGCGTTTATCGCAATCCCGTGCACATAAAAAATAACGATATCTGTTGTAAAAACAACGACTGTTATGTAATCAAAGACGGTAGCTGTATAACAAACATTCCAAGTAAACTTATTGTAAACAACAACTTACATAATTTGCAAGTAAACAGCCTCGACACCACGCCCTTCCATTCAGAAGGACCGAGTGAAGCTAGCAGTTATAAAAGTTCCGACAATGAAACCTGCCACGATTTGCCGACTAATGACAAATTGTCTTCCAGTAACGAAATATTGGATGATATTGATTCTACTCAAAACTCGATAAATACATCGAGTATATCTTTAAACAATATCGATTGCAATAACGATGAGACCGATGTACAAAATAATGTTTGTTTGGGACTTCGGACATCGCAATCATTTGACTCCGCCAAGCTGCCGGTACACCAAAAGGGTTCGCATAATATACACCGCTCAAAATCAAATGCCATTTACAACATACCCCAAAATAAAATTTCCTTTCTTGACACATATTTAAAAAGCTTACCTGCCCGCGCAACTTCCAATCCGGATCAGTTTACTTCACACCTACGCCAAATAACGTTAGAAACTAATTCATGGAAACCGAAGCGAATCGAATCTCAACCATTTTGTGCCCAAAAACTGGAAATATATAAAGTAGACGATAATACTACGGAGAATGATTTTATGAATATATCGACGGACGTGCTCGCCGATTGTGAAATGAAAAAACGTTTAAAAATGTACAGACGAGGTTTATCTGAAATTGAGTCCCGATGTGGAACATCAAATATCTTACCGAGAAAAAGACGCCACACTGTGAGTGGTGGCATGCAAGTCACTCGTTGGTCCTCATCTTCTGAGAGTGTAGACGAAGCCGACGTCGTTCTCAACCGTTTGAAGAGACGGATACTTAAGAACAAGTTAAGAGAGAAACGTCGCAGTTTCGTCTCCAAATCATCTCTTGATAACGACGCATGCAAAGAGAGCGAAAGTGAATCTCAAGACGATTGGCGCGATGCGGGCGGCGGGGCCTTCGTGAGGTCCCGCCTTGCTATGGGCTCAAACATCAGCCAGCACTCCGCTAAGGGTCTGAAGGGCCGTCGGGCACGCTCCAGTGGTGACCTCTGCAATGGCGAAACGAAAAGCCAAACCGAGTCCTCGGTTTGCGGCTCGGTTGTCGGTGACGTTATGGGTTGGAACAGATCGCTTCCTAACCATTTGGACGGGAACAGACATTTAGCGGATTACAGCCGCGTCAATAATAACTACGGTGACTTCGGTACCTATAGGAGTCATCCCAAAGGCGGGAGAGGCTTACGTTATCGCGTGTCAAAATCTGGATCTGACGCAGAGCCAGTGTGGAAGCTCCAAGATCCTGGTTTTGACCAGGGCTACGGTTCAGAAAGATCTCCTGAAGAAGACGTGCAAGCCATCGTCCCGCCGATATCTATCGAACAGTATGAAGCCGAGCTACGAACTGTTTATCCCTTTATTACTGATGAAAACACCTTCACCGTTGTCGTAGAGAAGGATGGGCGCGGGTTGGGTATGTCAGTGTGCGGTGGCGGAGGTCTAGTCAGAATACGCAGACTGTACCCCCCTCAACCAGCCTGGAGGACTGGCCGATTGGCTCCCAAAGACCTACTACTCTCTGCAAATGGTGTACCACTTGCCGGACTTAGCACTTACGAGGCACTAGAAGTGCTGCGTACAGCATCAGCCCGAGTGGAACTTCGCGTCTGCCGGCCTCCAGCCGATATGCTCGAGAGCATAACACCCCCCGACCCTCCCACACCGCCCGTAAGGACTCCTCATCCCCCACACCTGCCACTGGACCCACTCAACTGCCATCCCTTACACGCCAGATTATCACAAACAACGAGCAGTGCTACAACATCGTCATCCGAAGGTCGAGGTCGCCGTGACGCCAGTCCTGATACTGAAAGAAGAGTCCAGGATCTACATCTGCCGGATTTGGACCAACATCTACCAGTTTATGATATACAATATGGGGAATTTGATATAGTGATGACAAAAGTGAATGGATCTTTGGGATTTACTCTACGAAAAGAAGATCACAGCGCGCTGGGGCATTACGTGAGAGCGTTAGTTCGAGAGCCAGCTTTGAGTGACGGAAGAATACAGCCAGGGGATAGAATTGTCGCTGTCAACAACACACCGATGTCAAACATGTCTCACTCGGAGGCTGTATCGTTCTTGCGTGCGTGCGGGTCAGAGGTGCGCCTGCGACTGTACCGCGACCACGCTGCGACCCCCCTCTCACCCGTTTCCCCCAAGGACATACTTACAGACTCCGACGCACCGCTGAACAGACCAAAACCTCCATTGAGAATAATGACAACTTATTTCATTGGGCTGAAGCAGTATCGATGCTATGTGACCTGGCACGAGCTTCTGACTGAGGTCTACGGGGTGATGGTTCGCGATCCCCTTGTCTCTCACCTCGCAAACATCGAAGGCTCACCAAAGACAACCACTACGAACAGCCCAGTTTTCCGCACAGATGTATCAAATTTTGCTACGCTAGAGACACACACAAAACGAGAGAGCTGCAAACGTTCAGACACCGAACGTACAGCTGTAGAAAGAGGGTTGGTCTTAGAACGATCTGCATCCACAACAACGGAGATGCCAGTAGATAATAAAACACGCTATGAAGAACCGAGGCGGCGTATAGCACTCGCCAGTCCCACAGCTCCGCCGACTTGTAGAAAACAAAAGCTAAGCCTGACAGGTGAGACTGATTCCACAGTACCCCCTCATGGCTACGAGCTAAATAATTTAGACAACGACCAATTGGACGCACCGAATTTGTACCAGGAGAATATAACACTACAGCGGTTCATTGAACCGGCATTTCCGGTTGATTCAGATGAGCCTGTGTCTATGCCAGCTGAACTGTCTAGTGATGAAAGATTTAAGCATTCAAGTCCTGCTTATCAGAGCGCTGTACTCCATACCACAACTACTGAAAGTACCTCCGATGGAAATAAAGATGACGGTAACGGCTTAAAAAAGTGGAAGGGAGTTGCCTTGTCACCTGACAATGACAGGAAGGCAAAGCCACCTACATCGGAAATTCCCCCTAAGCCTATGGAAGCTAAAGAAATTGAAACCGTCCAGAAAGAAAATGTGAAACCTGATCAACCGGTGGAAGCCACAAGCACAGAACCTACGATAGTGACAGTGGAACTGAACCGCGGCTGGAATAGTCGCCTGGGGTTCAGCGTCCAGAGCCATCCGGAATCGGGCCAGAGTTACATCTCCGCTGTTTACAACGACAGCGTCGCCGCCAGAGACGGGAGGCTGCGGCGCGGAGATGTTATATTACAGGTGAACGATGAGAACGTAACTTCAATGAAGACGCCTGAAGTCATCGATTTACTTCGAATATTGCGCGGATCGATCTGCATAACGGTACTACGGCCAGCTAATGTCTGA

Protein sequence:

>DPOGS200478-PA
MADKEDTSDSDGPIIDDNVIRIQLKKPTHRKWELRTRNLSPGLKFPTIQLFDSEGNLLVETNDENISITSLDHDESPLRTSQSLREKDYKHIVRKKIIRKHNTISGDRIYTRVYRNPVHIKNNDICCKNNDCYVIKDGSCITNIPSKLIVNNNLHNLQVNSLDTTPFHSEGPSEASSYKSSDNETCHDLPTNDKLSSSNEILDDIDSTQNSINTSSISLNNIDCNNDETDVQNNVCLGLRTSQSFDSAKLPVHQKGSHNIHRSKSNAIYNIPQNKISFLDTYLKSLPARATSNPDQFTSHLRQITLETNSWKPKRIESQPFCAQKLEIYKVDDNTTENDFMNISTDVLADCEMKKRLKMYRRGLSEIESRCGTSNILPRKRRHTVSGGMQVTRWSSSSESVDEADVVLNRLKRRILKNKLREKRRSFVSKSSLDNDACKESESESQDDWRDAGGGAFVRSRLAMGSNISQHSAKGLKGRRARSSGDLCNGETKSQTESSVCGSVVGDVMGWNRSLPNHLDGNRHLADYSRVNNNYGDFGTYRSHPKGGRGLRYRVSKSGSDAEPVWKLQDPGFDQGYGSERSPEEDVQAIVPPISIEQYEAELRTVYPFITDENTFTVVVEKDGRGLGMSVCGGGGLVRIRRLYPPQPAWRTGRLAPKDLLLSANGVPLAGLSTYEALEVLRTASARVELRVCRPPADMLESITPPDPPTPPVRTPHPPHLPLDPLNCHPLHARLSQTTSSATTSSSEGRGRRDASPDTERRVQDLHLPDLDQHLPVYDIQYGEFDIVMTKVNGSLGFTLRKEDHSALGHYVRALVREPALSDGRIQPGDRIVAVNNTPMSNMSHSEAVSFLRACGSEVRLRLYRDHAATPLSPVSPKDILTDSDAPLNRPKPPLRIMTTYFIGLKQYRCYVTWHELLTEVYGVMVRDPLVSHLANIEGSPKTTTTNSPVFRTDVSNFATLETHTKRESCKRSDTERTAVERGLVLERSASTTTEMPVDNKTRYEEPRRRIALASPTAPPTCRKQKLSLTGETDSTVPPHGYELNNLDNDQLDAPNLYQENITLQRFIEPAFPVDSDEPVSMPAELSSDERFKHSSPAYQSAVLHTTTTESTSDGNKDDGNGLKKWKGVALSPDNDRKAKPPTSEIPPKPMEAKEIETVQKENVKPDQPVEATSTEPTIVTVELNRGWNSRLGFSVQSHPESGQSYISAVYNDSVAARDGRLRRGDVILQVNDENVTSMKTPEVIDLLRILRGSICITVLRPANV-