Monarch geneset OGS2.0

DPOGS214490
TranscriptDPOGS214490-TA2940 bp
ProteinDPOGS214490-PA979 aa
Genomic positionDPSCF300122 + 165033-174778
RNAseq coverage359x (Rank: top 33%)
Annotation
HeliconiusHMEL0139240.074.55% 
BombyxBGIBMGA001310-TA1e-6465.00% 
DrosophilaMagi-PA2e-4133.43% 
EBI UniRef50UniRef50_F4WQS88e-9444.94%Membrane-associated guanylate kinase, WW and PDZ domain-containing protein 2 n=7 Tax=Formicidae RepID=F4WQS8_ACREC
NCBI RefSeqXP_393571.31e-9143.08%PREDICTED: similar to Magi CG30388-PA [Apis mellifera]
NCBI nr blastpgi|3227890436e-9446.19%hypothetical protein SINV_11166 [Solenopsis invicta]
NCBI nr blastxgi|3320231856e-13636.61%Membrane-associated guanylate kinase, WW and PDZ domain-containing protein 2 [Acromyrmex echinatior]
Group
Gene OntologyGO:00055152.2e-24protein binding
KEGG pathwaytgu:1002311545e-72 
 K05629 (AIP1)maps-> Tight junction
InterPro domain[855-976] IPR0014782.2e-24PDZ/DHR/GLGF
[51-89] IPR0012022.9e-12WW/Rsp5/WWP
Orthology groupMCL10670 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214490-TA
ATGGCGCAGAGTTCTGTAGATTCCGATAGACTGTCATCAGCAGGTATATCAACTCACAGTTCTGGAACTGCTGGTACAGTTGGTGGTTCAGTGGTTGGTGTTAGAGTGGGAGATTCAAATCTTAAGAGGGAAGATGAAGATGATTATAACTTGGGACCTTTACCACCTATGTGGGAGAAAGCATTTACTCCCTCCGGGGAGAAATATTTCATTGACAACAACACCGGCACATCACATTGGTTGGACCCCAGATTAGCGAGGGTTAGAAAGCATTCTCTCGGTGAATGTGGTGAAGATGAACTGCCATATGGCTGGGAGAGGGTTACGGATGAGAGATATGGTTCCTATTACGTCGACCATATCAATAGAAGAACACAATATGAAAATCCCGTGTTACAGGCTCGGAGGCTGAGAGCTGAAAAAGTGGCTGCAGAAAATGGAAATAACAATCAAGTAAATGGACCTCAACCTGTTGGGGAATTACGAGGGGAGAGATTCACTATTACACTTACTAAGGGAACTCAGGGTCTAGGTTTCACTTTGATCGGCGCTGAAGGTACACCTGAAACGAATGGCTATTTACAAATCAGAAGTATTGTCCATAACAGCCCCGCCTGGATTGATGGTGGTCTTCGGCCAGGTGATGTGGTGATGGGCGTTGGCTCCAGACTGGTCCGTGGTCTAACCCACGCTCAAGTAGCACAGATCTTCCAATCTATACCGGCCGGAACTGACGTCACACTACAAGTCGTTAGAGGATATCCACTATCATTTGACTCCGAGGATCCGAACACGCGTGTGTTGTCAACCCTAGCGGTTGATGCCAACCCTACCACGCCAGATGCAGTGGCCATGCAACAACTAACAAGAGCACTTAGAACTAGGTTTAACCTGGATTCTCCAAGTGATGGTCAAAGTCGAGACGAGCCGGACACTTCAGTTCCAGAGCACAGTCCGGACCAGAGACGGCTTATGTCTCGCTCCTTTGACCTGGATGAACTGAACAATGAGTCCAGCACTGTACCTCCCAGGTGTCCATCTGCTGATCACCTGCTCAGCTTAGTCACTGATCAACATCGCAATAACAACGTAGCAAATGAGCCAGGGAGGGAATTAGAGGTGACTCTACGACGAGGAGCCGCTGGGTTTGGATTCACCATAGCTGACAGTGTACACGGACAGAAAGTTAAAAGTTTATATTTTCTTCTCCAGGTGTTGGACCGCAGCCGCTGTGCTGGTCTCCGCGAGGGGGACCTGCTGTTGACCATCGGGGATAATGACGTCAGACACGCCCGACACCAACACGTTGTTCAGGTCCTCAAAGACTGTCCAGCTATGTTGGAGACGACGTTAAGGGTTTGGAGACGAAACACCATCACAAGAGCTCCACGAAGCAGATCCGCCACAAGAGTTCAAACAACCACTACCAACAACAACCAACTCGGCAGAAGTAAGACACCAACAGCCGAGCAGTTGAGGTCTCCGCCTACCTTACACACGGAAAGTGATATAAGAGATATACAAAACGGAATGGATTCAGTAGAAGCTACTTATGCGGTCCCAACTCATAAAAATCGGGACAACAGAGAGAGTATAATGGAACGGCGTCGTAGAAGTAGCACCCCGGGAGCTAGACTCACGCTGCCAGCCGTTACCCCCGTTACTCCTGTTACCCCATGGAACGAGTACCAAGACTGGCAACATCCGTGGAACAGGGGAACGGACAACACAAATATTATACGCAACTGGGCTGAAAATAACCAAAAATGGGATGAGAATCAAAAATGGGATGGAAATAATACTGAATATATTGGTAGGGTAGAGGGTAACGGGTGGGAGGGGAACGGGGAGCGGTGGAACAATAATTGGGGGGAAGTCCCGACTCCCACAGTGCATGTGGATATGAACGCTGGTTATTGGAGGGGAAATGCGAGTGTTGCTGACAGTAGCGATCATGGAGGATTCCCGGAGGACGGTCTCCTGCCATCCCCCGCGGGCTGTCCGCTTGGGGCGTCACCCGCCAGGCCGGGGCCCAGCTCGCATTCACCAGACAACTCTACAGGTCGTCTCCGGTCTCACTCTCCGTCCACAGCGAGTGCCACAGACCGTCTACTGTCTTCCTCTCCCTCCAGGCTTCAAAGCCAGTCACCTTCAAACCATTCCAGCAGCAGTCGTGTTAAAGGCTCTCCTTTACGGTATATAAATTACTTAGTGGAAGCAGGTGGCAGCACAGACCCGGAAGCGGATGTCCTGGTCCGTCTGGCTCGCAGCTCAGCCGGATTCGGCTTCAGGATAGTAGGAGGCACTGAGGACGGCAGTAGAGTCGCCGTGGGATACGTTGTACCAGGGGGTCCAGCTGATGGTTTGCTACGACCTGGTGACCTTTTGACCTCTGTGGATGGGATCCCACTCGCGGGGGCCACGCACGCCTTAGCAGTGGCATATGTTTGCCAAGCCGCCTCCAGGGGTCATGTTACGCTCGGCGTGAGACACAACCGAGCCGACATACAGGCGCTAGGACTACCCGCGTTACCGACACAACCATTGCTAGCCGAGCCCCTGCACGGCCCCACATACAACATCCTGCACCCGGCGCCGTTCTACTCTTCCAATACAGCCATTCCTAGGTATGGGGTTCAATATGACACTAACGCTGGTTGGTGTTCGGCGCCTTATGACGTCACGGTCACCAGGAATGACGGAGAGGGGTTCGGGTTTGTGGTCATATCGTCCACAAACAAAGCTACCAGTACTATAGGTCAATTGATACCCAACTCTCCAGCTGCTCGTTGTGGTCGTCTCCGTGTAGGGGACACCATAGTGGCCATCAACGGCACCGCAGTCCGCGCCCTGCCCCATCCTGAAGTTGTGTCACTCATCAAGCGATCCGGGGCATCAGTCACATTGACCGTAGCCCCGCCTGACCTGCGACACGACTGA

Protein sequence:

>DPOGS214490-PA
MAQSSVDSDRLSSAGISTHSSGTAGTVGGSVVGVRVGDSNLKREDEDDYNLGPLPPMWEKAFTPSGEKYFIDNNTGTSHWLDPRLARVRKHSLGECGEDELPYGWERVTDERYGSYYVDHINRRTQYENPVLQARRLRAEKVAAENGNNNQVNGPQPVGELRGERFTITLTKGTQGLGFTLIGAEGTPETNGYLQIRSIVHNSPAWIDGGLRPGDVVMGVGSRLVRGLTHAQVAQIFQSIPAGTDVTLQVVRGYPLSFDSEDPNTRVLSTLAVDANPTTPDAVAMQQLTRALRTRFNLDSPSDGQSRDEPDTSVPEHSPDQRRLMSRSFDLDELNNESSTVPPRCPSADHLLSLVTDQHRNNNVANEPGRELEVTLRRGAAGFGFTIADSVHGQKVKSLYFLLQVLDRSRCAGLREGDLLLTIGDNDVRHARHQHVVQVLKDCPAMLETTLRVWRRNTITRAPRSRSATRVQTTTTNNNQLGRSKTPTAEQLRSPPTLHTESDIRDIQNGMDSVEATYAVPTHKNRDNRESIMERRRRSSTPGARLTLPAVTPVTPVTPWNEYQDWQHPWNRGTDNTNIIRNWAENNQKWDENQKWDGNNTEYIGRVEGNGWEGNGERWNNNWGEVPTPTVHVDMNAGYWRGNASVADSSDHGGFPEDGLLPSPAGCPLGASPARPGPSSHSPDNSTGRLRSHSPSTASATDRLLSSSPSRLQSQSPSNHSSSSRVKGSPLRYINYLVEAGGSTDPEADVLVRLARSSAGFGFRIVGGTEDGSRVAVGYVVPGGPADGLLRPGDLLTSVDGIPLAGATHALAVAYVCQAASRGHVTLGVRHNRADIQALGLPALPTQPLLAEPLHGPTYNILHPAPFYSSNTAIPRYGVQYDTNAGWCSAPYDVTVTRNDGEGFGFVVISSTNKATSTIGQLIPNSPAARCGRLRVGDTIVAINGTAVRALPHPEVVSLIKRSGASVTLTVAPPDLRHD-