Monarch geneset OGS2.0

DPOGS211732
TranscriptDPOGS211732-TA4206 bp
ProteinDPOGS211732-PA1401 aa
Genomic positionDPSCF300239 + 187142-249554
RNAseq coverage114x (Rank: top 59%)
Annotation
HeliconiusHMEL0173390.064.02% 
BombyxBGIBMGA013978-TA0.070.00% 
DrosophilaPatj-PA7e-4740.06% 
EBI UniRef50UniRef50_D6WPR07e-10541.72%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WPR0_TRICA
NCBI RefSeqXP_969924.28e-10641.72%PREDICTED: similar to GA11344-PA [Tribolium castaneum]
NCBI nr blastpgi|1892392802e-10441.72%PREDICTED: similar to GA11344-PA [Tribolium castaneum]
NCBI nr blastxgi|2700104096e-10333.08%hypothetical protein TcasGA2_TC009801 [Tribolium castaneum]
Group
Gene OntologyGO:00055152.4e-22protein binding
KEGG pathwaycfa:4747084e-29 
 K06095 (MPDZ, MUPP1)maps-> Tight junction
InterPro domain[23-136] IPR0014782.4e-22PDZ/DHR/GLGF
Orthology groupMCL26156 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211732-TA
ATGGACCCTCTTATCGCCAATTGGATGTTTCGTCCAGAGATCGTTAAGGTGCTTCAATGTGCGTGCGTGTATAATGTGTGCGTGAACGGGTTATGTACTGATGTTATTATAAATGTACAGCAGGTGTTCGGCGAGTGGGCCCAGGTGGAGGTTGTGGAACTTGTGAATGATGGATCCGGGCTGGCTTTTGGTATTGTTGGAGGCCGTTCCTCGGGAGTCGTCGTCAAGAGTGTCCTACCGGGAGGAGTCGCTGATAGGGACGGCCGTCTCCGTAGCGGCGATATGCTGTTACGTATTGGCGCTGTTTGTGTTGTTGGTATGTGCGCTCGTCAGGCTGCTTCTGTGTTGCGGCAGTGCGGTGCCTGTGTCCGGCTGCTGGTGGCTAGACCAGCGCATGGCGCTTTGCAGGCGTCTCTGGGTGCGTCACCAGCGCCGGTTGTGCCTTCACGGTTGCTGACTGATCCGATAGAGTTGGAGCGGACTCTGGCTGAGGCTGGTCATGATGGGTTTGGGGCACTCTGCGAGGACACCACACCGACATCACCACCACCTACCATCATCGAGGAAAGACTTCATCATCGGCCGGTGGCGTCTATAGTGGCGGTGGTACCACGAGGGGCGCTCGCTCCGTCTCCGCGACCGCCACCACTCATAACCACCTCGGAGCCACAACCACCACCATTGCGGCTTCCTTTAGATATGGCAATACGGGGATCCGAGCCTGAATCTATCAGCTACGAGGTGGAATTGAATAAGGACTCCGCTCTAGGACTCGGCATAACAGTCGCAGGATATGTTGACGGTGTCCCTTTGCATGGGGTCACAAATCACAGGGCTGTGGCCTTACTTAGGGATGCGAGGGGTCCTATTGTTAAATTAAAAGCCTTGAGATATCTTCGTGGTGCGGGTTTCGAACGACTTCAAAAGGCGCTGGCGGCTCAAGACGGCCGGCGCTCACCGATCGCACCCCCAAGCCCCTCAGTGACATCTCTTGCCAAATATAGCTTTAGTGTGTACGACGAATCAACGGTGATTGAAGCGGAGCCGGAGTCAGAGGTCCAAGGTCACCTTCCATCACCGACTTCCCCCGAGGATGCGGAGATCGTGATCGGCAGAGATGATGACATCTCAGAGAGAGAGGTCAGGAGAATACAGGACAAGTGGCGGGATGTCCTGACTGAGGAACCTGATTGGCCAGGACATGACCAGCCATATGATATCGTAGTAGGAACTATAATGAGAGAGCGGGATAGCGGCTTGGGGATTTCTTTAGAGGGCACGGTGAGGGTAGTTGCGGGGAAGGAGGTCCAAGCTCGCCATTACATCAGAGCCATCGCACCAAATGGACCAGTTGCTCGGATGGGTCTTTTTAGGGTAGGCGATGAATTGTTAGAGGTGAATGGTTGGCGTGTTTTGGGCGCTCACCACGTGGAAGTCGTCACCAGACTGAGGGCGGTTCGATCTCCAGCCGTTCTTGTGGCAGCAAGACGATTGCCTAAAACTCCGACGGTGCTGGGTGGAAGTCTCCAGGAACTACTAGCTCCTCCTCAGCGTCTCATTAAAGCCAAGTCTGAGAGTTCAGTGTCTTCACTGTGCAGCGCCAGCACGGTTCTATCCGGACATCATCACGACGAATTCTGCTCCGACCTTGAGCGTAATAGGTCAAGGTCGCTGGAGCCGCTCAGTGGGCTCGCCATGTGGAGCGATCAGGTTGAATACATACAACTAATGAAGGAAGACAGAGGACTTGGCTTCTCGATCTTGGATTATCTGGTGAGCATAGTGACCATTGGTATATCGAAACCTCTTCCTTGTAGCACCATCGTGGGTGGCGAACGAGCCAACGGACAGAGCGATCAGGGCAGTCAGGAATGCATTAATTCGTTAGCATCCGACGATCATACTGGGATGGATTCCTTCCACGAATGCTTACAGGAATACGGTGAAGGTTTGGAAGTTGTGCAAATATGTCTGGAACCAGAAGATGAATCTGCTAAATACGAGGACTTGTCCGCTTTGGCTGACGTGAGTTACGATGATTGTAAGAAAGACACTATCAGAGACGATGACTTTATAACGGAAGTCAAAGATAACAAACAAGTTAACGGCGTTATGGAATTTAAATCGTCCAAATCTGATGGATCCTTGGACGATCCAGATGATGACATGACCATAACCGAGGATGACACTTTACACACGCCGCGTGAGCTGACTATGAGAAGTAAATCAGAAGACAGGAAAGATAAAGATCGGGGGAGAGGTTTGTCAAAACAGCAGAGCAAAGACTTATCAACGTCGGACGAGATGGTGTTCGCAGTTACACCGACAGGTCTAGAAAGAATCAGCTTTGAAAAATACTGGAAGGACGTTGATGTGAGCAAAAGCAAAACACAGAACGACGATAGTTGGAAAGAATGTCCCAATTCCCCATCCGACGATACGAGCTTCTACGATGCGACCATGAGAATATCTGTACAGGAACAAAACAATTCCCTACTGTACATTGACCATGAGATCGATGGGTATGAGACATGCGTAGATGATGACTCTTCGTCTGTTAAGTGTCCAGACAACAGCAGCTATAAAATTAACGGAGAGAGCGAGATCAAAGAATTAGATAAACACGATAAAATACACTTTAATAACAAAGACAACGATGTTTGTGCCAAAACTGAAGACGATCAGATCACTGCCAATGCCGTAGACGATGTGTCGTCTTACAGCCATCCGTATATAGAAGAACATATTATTAAACAGATGAAATCGTTAAGAATAGAGCCTTTAAGTATACACATAAATAATAAAAAAATAAAGAAGAAAAGTCCATCGAAACCGTCCAAGAGTGAGCGTCGCGTTATCGAATATTACAACGACCAAGCTGAATGCTTAAAAGAGATAAGACAAAAGAAACTAGAAGAAGAAGAAAAGAAGTACAGGGAAGAGCATCAGATAAGTGAATGTTATAAAATAGCTGAGAAGAAAATGCATTTGAAAAAACAACTGCCAAGCGAAGAATCGACAGACGTTGAACCTTATGTGTCCGGATGTCATAAATGTTTTTTAGAGAACTACGCTTACGAAATAACGAAGGCTTATATAAAAGAAGCGAGCACTTGTAAATATTGTGAGGTGATTCGTGATATGTTAGAAGGGCCTAAGGTTATGCCGAACAGAAGGATGTCTGCTCCAGCACTGTTCAATCTGGAGAGTATAGATACAGACACTGATGATGAAGATCTTCTGGCTGTTCCTGTAGTCAAAGATAGGAGGAAGTCCACCGGTCCATTAAAGTTTCCAGTAGCTGGCAGAAGATCGAGGAATTGTATTTGTGTACGATATTGTGCATCCAGTGCAATGAACCGGTCTAAATTATTCACTCAGCTGAGCACACTAACTCGGAAATTGATGTTGAATTATACACCATTGACTGAATTAAAACTCAAACATATCCGCCCTGATAGATATGTGGATATTATTTCGAAGCAAAATGGCGAAAGCGGTGGTGAAGAGAAGGCGATCTTCGGTATTTTTATTAAAAATGTAGTGCCCAATAGTCCGGCAGCGCTTTGCGGGGAACTACAAACAGGAGATAGGATCTTGGAGGTGGATGGTGTGTGTGTCCGAACAGCGCAGCATGAACGAGCTGTGGAACTGATCAAGGCTGCCAGGGAAGTCGTTGTTCTTACTGTGCAAAGCCTTCTGACTTGGAATACAGACTCGTCTGATGTAGAGGCCTCCCCTGCCACGTCTCCACCGAGGCCCGTCAAGAAAGCACCAGCTCCTAAACCACCTCAGCATGAAATAAAGATAATGGTAACCGAACCAGATGTGGAAAGAAAACAAGCGAAAGAAGAAGAGAAAGAGGAAAAAGGGAAAGAGAATGAGAAACAAACAGAGAAACAACCAGAGAAAGTGAACGGAGAGGTTCAACCGAAGAAAGTTTACTCAGATTCAGAAAGCAGTGACGAAGAAGACGAAAGAGAATTACAAGGAAGGACTTATTCCGAAAAAGGCGTTGAGATTGACCGAGCATCAGCTGGTGCTATTAAACGCAGTAGAGAAGAGAAGGAGGCTGATCCAGAGGAAGAAGACGACTTCGGTTACACCACAAGTAAGTGGAAAGATACATTATTCACGTTGTTTAACTTCACGTTTTATTTTCTCCACGTATATTTCTACGCCTGTATCGTAGAGCACAGCCGATAG

Protein sequence:

>DPOGS211732-PA
MDPLIANWMFRPEIVKVLQCACVYNVCVNGLCTDVIINVQQVFGEWAQVEVVELVNDGSGLAFGIVGGRSSGVVVKSVLPGGVADRDGRLRSGDMLLRIGAVCVVGMCARQAASVLRQCGACVRLLVARPAHGALQASLGASPAPVVPSRLLTDPIELERTLAEAGHDGFGALCEDTTPTSPPPTIIEERLHHRPVASIVAVVPRGALAPSPRPPPLITTSEPQPPPLRLPLDMAIRGSEPESISYEVELNKDSALGLGITVAGYVDGVPLHGVTNHRAVALLRDARGPIVKLKALRYLRGAGFERLQKALAAQDGRRSPIAPPSPSVTSLAKYSFSVYDESTVIEAEPESEVQGHLPSPTSPEDAEIVIGRDDDISEREVRRIQDKWRDVLTEEPDWPGHDQPYDIVVGTIMRERDSGLGISLEGTVRVVAGKEVQARHYIRAIAPNGPVARMGLFRVGDELLEVNGWRVLGAHHVEVVTRLRAVRSPAVLVAARRLPKTPTVLGGSLQELLAPPQRLIKAKSESSVSSLCSASTVLSGHHHDEFCSDLERNRSRSLEPLSGLAMWSDQVEYIQLMKEDRGLGFSILDYLVSIVTIGISKPLPCSTIVGGERANGQSDQGSQECINSLASDDHTGMDSFHECLQEYGEGLEVVQICLEPEDESAKYEDLSALADVSYDDCKKDTIRDDDFITEVKDNKQVNGVMEFKSSKSDGSLDDPDDDMTITEDDTLHTPRELTMRSKSEDRKDKDRGRGLSKQQSKDLSTSDEMVFAVTPTGLERISFEKYWKDVDVSKSKTQNDDSWKECPNSPSDDTSFYDATMRISVQEQNNSLLYIDHEIDGYETCVDDDSSSVKCPDNSSYKINGESEIKELDKHDKIHFNNKDNDVCAKTEDDQITANAVDDVSSYSHPYIEEHIIKQMKSLRIEPLSIHINNKKIKKKSPSKPSKSERRVIEYYNDQAECLKEIRQKKLEEEEKKYREEHQISECYKIAEKKMHLKKQLPSEESTDVEPYVSGCHKCFLENYAYEITKAYIKEASTCKYCEVIRDMLEGPKVMPNRRMSAPALFNLESIDTDTDDEDLLAVPVVKDRRKSTGPLKFPVAGRRSRNCICVRYCASSAMNRSKLFTQLSTLTRKLMLNYTPLTELKLKHIRPDRYVDIISKQNGESGGEEKAIFGIFIKNVVPNSPAALCGELQTGDRILEVDGVCVRTAQHERAVELIKAAREVVVLTVQSLLTWNTDSSDVEASPATSPPRPVKKAPAPKPPQHEIKIMVTEPDVERKQAKEEEKEEKGKENEKQTEKQPEKVNGEVQPKKVYSDSESSDEEDERELQGRTYSEKGVEIDRASAGAIKRSREEKEADPEEEDDFGYTTSKWKDTLFTLFNFTFYFLHVYFYACIVEHSR-