Monarch geneset OGS2.0

DPOGS215277
TranscriptDPOGS215277-TA6108 bp
ProteinDPOGS215277-PA2035 aa
Genomic positionDPSCF300047 + 549143-561649
RNAseq coverage563x (Rank: top 22%)
Annotation
HeliconiusHMEL0127210.083.03% 
BombyxBGIBMGA009085-TA0.090.41% 
DrosophilaCG6509-PA3e-9546.72% 
EBI UniRef50UniRef50_E0VGQ10.039.82%Discs large protein, putative n=1 Tax=Pediculus humanus corporis RepID=E0VGQ1_PEDHC
NCBI RefSeqXP_002425295.10.039.82%discs large protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420090280.039.82%discs large protein, putative [Pediculus humanus corporis]
NCBI nr blastxgi|3407121020.038.76%PREDICTED: disks large homolog 5-like isoform 2 [Bombus terrestris]
Group
Gene OntologyGO:00055155.2e-26protein binding
KEGG pathwaydre:5654462e-34 
 K06098 (TJP2, ZO2)maps-> Tight junction
    Vibrio cholerae infection
InterPro domain[1214-1331] IPR0014785.2e-26PDZ/DHR/GLGF
Orthology groupMCL11677 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215277-TA
ATGGCATCTGGCACATCATCTCTAGAGAGTAATGGGAGCAATGAGTCCATATCCCACAGCTACGTAAGCAATGAAAGACATAACACAACTCAGCTGCCGACAATACCTATACCTATTGAATATGAAGTTCTATCATCGTCCCAGTGCTACGACAAGCCCAACTACACCAGCCATACTCACACCATGAGTGCTCCGTATGAGCTGGGGCAGTATGACGTCATCAATAAACAAGCTAATGAGAATTTGAAGCAGCAGTGCGAGAAGGCCCTGCATGATTTGAACCAGTTACGACGACAGCACACCGAGACATCCCGGCGCTGTGAGCATGTTATGAAAGAGTTGGAATACTTCCGCGGCCAGCATCGAGCGGCGATGAACCAGCTCGAGGTGTCCGCTCAGGAGGCGAGCAGCCTGCGCGGCAAGTACGGGGACTTGCTCAATGATAACCAACGACTGGAGCGCGAGGTGCAGGTGCTACAGGGCGGAGGCGCCGAGGGCGGGGAGGCTCTCGTACACGCGCTCAGAGACTTCGACGCGCTCAAAGAGGAATTTGAATGTTTTCAGAAGAGATATGATGACCTCATCGAAAACCACAACACAACCTTGGAGAAGCTTCAGATTGTTCAGGAAGAGAATAGTAGGCTGAAGACACAATGCCATGACCTCACACAGGAGAGGAATTCAGTTATCCGGGAGCGAAACGCCCTGAAGCAGCAGGTCGCCAGCGTGGTGCGGCAGTGGGAGGGCGGCGTGCGGGAGAGAGCGGACATCAAGCGGCTCACGGACGAGAGGAACGCCGCCATGGCCGAGTACACGCTCATCATGAGCGAGAGGGACACGGTTCATAAGGAAATGGAAAAGCTATCTGATGATCTGCAAGCCGCTTTAAAGCGGGTAGCGGCGCTGGAGTCAGCGCTTCAAGCGTCCCGCGACGAACAGCGAGCTACCGCACTGCAGGCCGAGAGTCTCCGTCGTGAGATCGAGTCAGCTCTAGCCGACCGCGACCGCGCCATCAAAGAGTGCACCGACCTTAAGACACCACAGAACACCAACACTCTGGGGAAGTCGCGGCAAGAGAACTCTGTATATCATCACCTGGGACTCTCGACTGGTGTTGGCACCGACGGCTTCTTGAACGGACAGCATTCCAATGATCCGTCTATGGATAAGTTGCAAGGTCTCGTGCTGGGTGAAGGCGACAAGGCCCAGTCCGACAACCTGGAGCAGGCCAACCAGGAGCTGCAGCGACTCAGGGCTCGCTCACACCGCCTGCAGGCCGAGCTGGCGGACGCGCTGAGGGAGGCCGACGTGGCTCGCCGGAGGAGGGACTGGGCCTTCTCCGAGAGGGACAAGCTGCTGCAGGAGAGGGAGAGCGTTAAGGCTCTGTGCAATAGACTGAGAAAGGAGCGAGATCAGATGGTGGGCGAGCTGGCGGAGGCGAGGCGACACGGAAACAACAAGAAGGAGACCAAGGACAAGCCGCGAGCCAGGTCACACGACGACAAGTGTCGCCTGGCCTGGTACAGGGACTACGGGGACTTGCAGAAGTTGGAGTGGGAGGAGTTGGAGGTGGAGCTGGAAGGCAGCCCCGAGGCCTTGGGGTTAGAGTTCTGTGGAGGGAGAGAGGCTGGGACCGACCAGCACGTTTACGTCACGTCCGTGCGGACAGGCTCGCTGGCCGACGGGAAGATACTTCCTCTAGATCGTATCACGGAGGCTTGCGGGGGGTCGGGCTGGTCCCGGGGCGGAGTGCTGGCGGGCTGCCTGAGGGCGGCGCTGTCCCGGGGCGGAGCCCTGCTTAGGCTCCAACGAGCGCGTGCAAGGAGAATTGACATACGACTGGCGCCACGGGACCACGCGGCGCTCACACTCTCACACGGTATTTTCATCAGCAAAATAAGTCGCGGCAGCGCCGTCTCGAAGGAGGGAAGTCTCCAGGTCGGAGACAGGGTCGTGAGCATCAATAATCGTTCTCTGGAGGGCGTCAAGAGTGTGTCGGAAGCGCTATCTCTGTTAGACGACAGCGCGTACGATGGCGCCACGCTCACCGTGCTGAGAGCACTGCCGACCTTACAACGACATTCCGGCGACGGAAACAACAACACGCGCTCACTGAGACATTGTTATGTGACCAGGGCTCCCGACTGTGACGGGGACTGCAAGGACGAGACCTGCGCCTTCCTGCCGCCGCCCGACACTAAGATCCACACAGAGCACGGCGTGCCGGATTTTGATAGACGATACGTGTACCACGTGACCGCCGGCAGCCAGGGGAAGATCTCCCAGGACAGGGGCGCGTGGGACATGCTGAGGGGGAAGATAGAGGCGGTCACCAGCAAGGGGAAAGACAAACAGAAGGTGCCCGAGTCGGACGCCATCGCCGAACTGGACTCCGTCATAGACAGCTATCACGGGAAGACAATAAAAGAGACGAGCAGCGTGTTGAAAAGATCACGACGAAAAAATAAAGAAGCACAGAATGATGCTAAGAATGGCGGCACGTGGCCTAAGGCGCGGGCGAACTTTGTGCTAGAAGACAGCGCCACGGGGACCATAGTGCAGCCAAGGTCCAGGAAAGAGAGACCGCCGCTGTCCATCCTGCTCAGTCCTCCGACTAAACTGCCGCCCGAACCTCAGAGGTCCAACACGAACCGAAACTCGAACCCCATACCTCTGGGACACGTTCCATTGACTCAAGTGACCACGCCCGCACGACATTCGGTTTATAAATCTATAGAATCGAGTCCCGCTGTGGAACATTTTCCGAAATCGGCTCCCTTAGCCATACATAACCCCTTCGCTCCGCCAAACATGAGCTTCGAGAAGAATCGCACTCTGGAAAAGGACAAAATATCTATCAGTGACTACGAACAGGACGTCACTCCGAACAGATTAAGTCTAGTATTGAACCCGTCAGAAGACTCTCTCTTCTACCAACCGGGCAGGAACCGCACTAAAAGTCCTCACTCCTTGGACTTCATGGTGAACAAGGGTCCTAATTCCCTGGACTTCGTGTCTAGAAAATCACCGAGTTCCCTGGACCATACAATAAAAAATCATACGGGCAAAGATATACTCGACTCTTACTATTCATCGAAGAAGTCATCGTCACCGAAGGCGGTGAATAAATATCCATCGGACAGCGACAGCTTCGGGCCCGACAGCTTGAACAGCAACGGGAATTTCCCGATGACCGGCACGCTGCCCTCACAGTCGCGGCTACAGTCACAATACTACAGAGGTCCTTTCACAAATTCGAATCCACACTCTTCGAGTCGCTACACGCATCTGTCGAGTCCAACCAATATTCCCCAAAGTCAATCCGGGGAGAGCATCGGCACGAGCTACGACATGCACTCGTTCACATCCCACACGCACAACATGTCTGACCTACATCCAGTTCCGAGGGTCAACAGGGACTATCATCACACATACGAGGGCGGGACTTTTCCTAGGAAAAAAGAGAACCAGCGATTCAGAATACCGTCTAACCCGAGTGTGACGTCGAAGAATTCTGCGGGAAAACTCAGCACGGGCTCCATAGAACGGAGCTCCGAACGAAATTCTCCGATGCCGACTTTTCATGTTGAAGTTTTAAGTCCGGGACGAGGTGCAAAACAAATGGCTCATAAAAGTACTAGAAATTCGATGCCGGAATATTGCGGGCTGGGCTGGAACAAGCCGTTGCCGGGAGAGCTGAGGAGGGTGCACATCGATAAGAGTCAGCAGCCTCTAGGCATTCAGATATACTGTCCGCCGAGCAGCGGGGGCGTGTTCGTGTCCACCGTCAACGAGAACTCCTTAGCCAGCCAGGTGGGGTTGCAGGTTGGAGATCAGTTATTGGAAGTGTGCGGCATCAACATGCGCTCAGCGACGTATACTTTAGCAGCGAGCGTATTACGTCAGATAGGAAATTCCATAACGATGTTGGTGCAGTTCAGTCCAGAGAAATACAAGGATGAAATGGAGGTCCCGGGAAGTTCTTCATCCGGGGAGAGTTCTCACGACGAAGAAGTTTCCTTATCCGGGTCTCCAACACCCAGGAACTCACCCGGACCTCAGCAATCATTACGTATGGACGTGCCATCAACAGACGCGGCTACACTGCGACAGTCGCAGGGTAATAAAGATCTTCCCAGGTTTCTCTTAATAGAAATGCGCAACTGTTCAGATTTGGGCATCAGTCTTGTGGGTGGCAACGCCGTGGGCATATTCGTGCACAGCGTTCAGATGGATTCTCCGGCGTATATCGCGGGATTGCGGACAGGAGACCAAATCTTGGAATACAATAACGTGGACTTAAGACGAGCCACCGCTGAACAAGCCGCTCTGGAACTGGCGAAGCCAGCGGACAAGGTGAGCGTGTTAGTCCGACACGACCTCCAACAATATCACGAAATAAAAGACAAGCCTGGCGATGCGTTCTACATACGAGCGGGTTTCGACCGCTGTGCTAAGATCAACTCCATAGGCATGGACACCATAGATGAATACTCGCTGTGGTTCAGAAAAGACGAGGTGCTCTTCGTAGATAACACACTGTTCAATGGAGCTCCGGGTCTGTGGCGCGCCTGGCAGTTGGACTGCAAAGGAGTCAAGCGACATTGGGGTACTATACCCAGCAAGTTCAAAGTGGAAGACATTTTGCGTAGATCTGTCGGCACTTTAGACAACGAAGCCCAACGACGAGCTTCCAGTACAGCACGGAGGTCGTTCTTCCGCCGCAAGAAACATAGGGACAGTAAAGAGCTCGCGTCTTTCTCAAACACGGAGCTGGGATGCTGGAGCGACTCCGGGACCCTGGCCGATGACGCACCTCCACTGTCCTATCAGAGAGTGGAGCGGCTCCATTATGGGTCTCGGCGGCCGGTGGCGGTGCTGGGTCCTCTGCGGGAGTGCGTGGCCGGTAAGCTGGTCACGGACTGGCCGCACGTGTTCGCACGCGCGAGGCCGGACCCACGAGCTCTGAGGGACTTGGCTGATAAGGGTATTCACTGCATAATAGACGTCAGCGTGCCGACGATAGAGAAGCTTCACAAGCATCAGATATACCCTATTGTTCTGTTCATCAAGTTCAAGTCCTTCAAACAGATCAAAGAAGTCAAAGACACGAGGTATCCCGCCGATAAAGTGTCGGCCAAAGCCGCCAAGGAGATGTACGAGCATGGATTGAAAGTCGAAAACGAATATAGGAATTATATATCGGGAGGCGGTGGAGGAGGAGCAGAACAAGACGCTGTGGGTCCCCTCTGCACAGGCCTGACCCCTGCCCCCCACGCCGCCGGCACCCGACCCGCTCCCCGCCCTCGCACCCCCGCCCCCGCCCCCCGGCACAAGTCAGTGACCTTCTGTGCACACTGTAAATACGACCGGCAGCCGGCGCCGCTCCCTGACACGGACGTCAGATACTACTCCGACCCCGGACTTCCCGAGCCGCCCAAACAGAAGACCAAAAAGAAACCCAAAGAGACCAAGTACCACGACAAGATGGATGCTGACCTCAGATACTACTCGGAGCCGGATGCCAAGTACTTCGAGATAGACTACGTGTTCCTCAAGGAGCTCAGGAAGATCGCTCGCAACAACTGTCCCAAGTGTAAGAGGAAGAGGTCGAGGAAATACGAAGGTAAGACCAAGGAGAGGCGACAACGGGGCGTCCGGCTCAGGGAGAACTACGTGCTCTGTAATGGAAGGTACCACAGCGAGATGGAGCACTGCCAGCTGTGCTGCAAGATCTGCAACCGCTCCACGGACACCCTGGACTCGTTGGACGATTACTCGTTGGCGTCCAAGAGCTACAGCCTCCCTTCTAGCCGCGCTACCACGAGTTCCCGATCCAAGTCCACACCCAGCCGTCGCGACAGAAAGAACTCGATCCAGTCGAACAAATCGGTGTCTTTTCTCGAATCCAGTAACGAGGAGCTGAAGGACGAGCCCAACTACACCGCCACGAGTTTCACGAACGATCTCAAGAAGTTCCTCCTCAAGCCGTCGTCGCCGAGGCTCAGCCTGCGGGAGAAGTTCATCATCAGCTTCAAGCAGGAGCTGGAGGCCACCAAGAGGTCGCCCAAGCTGCGAGCCATCAAGGGCCTCTGGAAGTCGTACTGA

Protein sequence:

>DPOGS215277-PA
MASGTSSLESNGSNESISHSYVSNERHNTTQLPTIPIPIEYEVLSSSQCYDKPNYTSHTHTMSAPYELGQYDVINKQANENLKQQCEKALHDLNQLRRQHTETSRRCEHVMKELEYFRGQHRAAMNQLEVSAQEASSLRGKYGDLLNDNQRLEREVQVLQGGGAEGGEALVHALRDFDALKEEFECFQKRYDDLIENHNTTLEKLQIVQEENSRLKTQCHDLTQERNSVIRERNALKQQVASVVRQWEGGVRERADIKRLTDERNAAMAEYTLIMSERDTVHKEMEKLSDDLQAALKRVAALESALQASRDEQRATALQAESLRREIESALADRDRAIKECTDLKTPQNTNTLGKSRQENSVYHHLGLSTGVGTDGFLNGQHSNDPSMDKLQGLVLGEGDKAQSDNLEQANQELQRLRARSHRLQAELADALREADVARRRRDWAFSERDKLLQERESVKALCNRLRKERDQMVGELAEARRHGNNKKETKDKPRARSHDDKCRLAWYRDYGDLQKLEWEELEVELEGSPEALGLEFCGGREAGTDQHVYVTSVRTGSLADGKILPLDRITEACGGSGWSRGGVLAGCLRAALSRGGALLRLQRARARRIDIRLAPRDHAALTLSHGIFISKISRGSAVSKEGSLQVGDRVVSINNRSLEGVKSVSEALSLLDDSAYDGATLTVLRALPTLQRHSGDGNNNTRSLRHCYVTRAPDCDGDCKDETCAFLPPPDTKIHTEHGVPDFDRRYVYHVTAGSQGKISQDRGAWDMLRGKIEAVTSKGKDKQKVPESDAIAELDSVIDSYHGKTIKETSSVLKRSRRKNKEAQNDAKNGGTWPKARANFVLEDSATGTIVQPRSRKERPPLSILLSPPTKLPPEPQRSNTNRNSNPIPLGHVPLTQVTTPARHSVYKSIESSPAVEHFPKSAPLAIHNPFAPPNMSFEKNRTLEKDKISISDYEQDVTPNRLSLVLNPSEDSLFYQPGRNRTKSPHSLDFMVNKGPNSLDFVSRKSPSSLDHTIKNHTGKDILDSYYSSKKSSSPKAVNKYPSDSDSFGPDSLNSNGNFPMTGTLPSQSRLQSQYYRGPFTNSNPHSSSRYTHLSSPTNIPQSQSGESIGTSYDMHSFTSHTHNMSDLHPVPRVNRDYHHTYEGGTFPRKKENQRFRIPSNPSVTSKNSAGKLSTGSIERSSERNSPMPTFHVEVLSPGRGAKQMAHKSTRNSMPEYCGLGWNKPLPGELRRVHIDKSQQPLGIQIYCPPSSGGVFVSTVNENSLASQVGLQVGDQLLEVCGINMRSATYTLAASVLRQIGNSITMLVQFSPEKYKDEMEVPGSSSSGESSHDEEVSLSGSPTPRNSPGPQQSLRMDVPSTDAATLRQSQGNKDLPRFLLIEMRNCSDLGISLVGGNAVGIFVHSVQMDSPAYIAGLRTGDQILEYNNVDLRRATAEQAALELAKPADKVSVLVRHDLQQYHEIKDKPGDAFYIRAGFDRCAKINSIGMDTIDEYSLWFRKDEVLFVDNTLFNGAPGLWRAWQLDCKGVKRHWGTIPSKFKVEDILRRSVGTLDNEAQRRASSTARRSFFRRKKHRDSKELASFSNTELGCWSDSGTLADDAPPLSYQRVERLHYGSRRPVAVLGPLRECVAGKLVTDWPHVFARARPDPRALRDLADKGIHCIIDVSVPTIEKLHKHQIYPIVLFIKFKSFKQIKEVKDTRYPADKVSAKAAKEMYEHGLKVENEYRNYISGGGGGGAEQDAVGPLCTGLTPAPHAAGTRPAPRPRTPAPAPRHKSVTFCAHCKYDRQPAPLPDTDVRYYSDPGLPEPPKQKTKKKPKETKYHDKMDADLRYYSEPDAKYFEIDYVFLKELRKIARNNCPKCKRKRSRKYEGKTKERRQRGVRLRENYVLCNGRYHSEMEHCQLCCKICNRSTDTLDSLDDYSLASKSYSLPSSRATTSSRSKSTPSRRDRKNSIQSNKSVSFLESSNEELKDEPNYTATSFTNDLKKFLLKPSSPRLSLREKFIISFKQELEATKRSPKLRAIKGLWKSY-