Monarch geneset OGS2.0

DPOGS214049
TranscriptDPOGS214049-TA2646 bp
ProteinDPOGS214049-PA881 aa
Genomic positionDPSCF300171 - 378232-398796
RNAseq coverage1478x (Rank: top 9%)
Annotation
HeliconiusHMEL0049780.087.74% 
BombyxBGIBMGA010382-TA0.078.34% 
Drosophiladlg1-PM0.054.63% 
EBI UniRef50UniRef50_P310070.057.17%Disks large 1 tumor suppressor protein n=34 Tax=Eumetazoa RepID=DLG1_DROME
NCBI RefSeqNP_001096956.10.060.75%discs large 1, isoform L [Drosophila melanogaster]
NCBI nr blastpgi|3838490850.063.85%PREDICTED: disks large 1 tumor suppressor protein-like [Megachile rotundata]
NCBI nr blastxgi|3838490850.063.49%PREDICTED: disks large 1 tumor suppressor protein-like [Megachile rotundata]
Group
Gene OntologyGO:00055158.8e-76protein binding
KEGG pathway 
InterPro domain[690-869] IPR0081458.8e-76Guanylate kinase/L-type calcium channel
[691-867] IPR0081444.2e-62Guanylate kinase
[1-63] IPR0151435e-35L27-1
[491-718] IPR0014529.6e-32Src homology-3 domain
[374-499] IPR0014785.5e-31PDZ/DHR/GLGF
[7-67] IPR0041723.6e-10L27
Orthology groupMCL10371 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214049-TA
ATGCCAGTGCGAAAGCAAGAGGCCCATCGAGCGCTGGAACTGCTGGAAGATTATCACAGCAAGTTAGTGAAGCCGCAAGACAGACAGCTGAGACTAGCTATTGAAAGGGTTATAAGGATATTCAAATCAAGGCTCTTTCAGGCCCTACTTGATATTCAAGAATTCTACGAGCTAACTCTCCTTGACGACAACAAGTCCATTCAGCAGAAAACAGCTGAGACGATCCGAATCGCTAACAAGTGGGAGGCTGACGGAGTACGACGGGAATCACAGACTGGAGACAATGACTTTAGATCAGTATCGAATGCCTTCCTTCAAGACACCAACAAGAAATCCGAGAATCGAAATGCGGAAACTCCAGATTCGGCAAGGATGGGAACAATTCCTTCGATTATATCACCAACCGGAGACACAGAGGAAAAAAAGGCACACGTTTCGTCACCTGAAATGAAACAGATAAATGGGATAGAGGGTGGTGCTGAGCGTACGGTGGGCGACGATGGTCATTTGTACCTTACTGTAGTATTATCACGCGCCGGTGGAGCTGGTTTGGGATTCAGTATCGCTGGGGGTTCCGACAACCCCCATATAGCGGATGACCCACTCATCTACGTCACCAAGCTCATTCCAGGCGGGGCCGCCGCTGCCAGCCAACTTCAGATCAACGACGCCATATTACAGGTTAACGATACATCAGTCGAGAACGTGACGCACGCCGAGGCGGTTGATGCGTTGAAAAAGGCCGGCAGTAGTGTTAAATTGAAAATAAGACGTCGACAAGTTGAAGACACTCTGAACGTGTCCACGACATCAAACAGAGAAGAAGCAGTAGAGATTGAACTGGTTAAAGGCGGTTCAGGCTTAGGGTTCAGCATAGCAGGTGGCCTGGGAAACCAACATATACCTGGTGATAATGGAATTTATGTGACCAAGATAATGGCTGGCGGAGCTGCCCACAGAGATGGACGTTTGAGGGTCGGAGATAAATTACTCATGGTCAAAAACACTTCTAAAGGCGATGTGAATTTGGACAATGTGACTCACGAAGATGCTGTGAGCGCTTTAAAGGCTTCAGGGGAACGGGTGCAGTTAGTACTTATACCGGGGCCCAGACACGGCCAGCCCTCGCCCAGAACATCACGAGCAAACACCCCCTCAAGCACCGCTAATTCCCTGCGAAGGGAAGACGTAGTCGACGGAGAGGAGCCGCGAGTAGTGGAATTAGAGAAGGGCCCGCAAGGCCTGGGTTTTAACATCGTGGGCGGCGAGGACGGCCACGGTATATATGTATCGTTTCTTCTCGCCGGCGGACCTGCCGAGAGGTCGGGCCAGCTCCGGCGTGGTGATCGCCTGTTGGCCGTCAACGATGAAAACATCACATCCGCCACACACGAACAGGCGGCTAAGGCTCTCAAGAGCACAGGACAGAACGTCAAACTAACGGTCGTGTACAGGCCACAGGAATATAATAAATTTGAAGCCAGGATCAACGAATTGAAACAACATCACACACTGCTGAGAACCTCCCAAAAGCGATCGCTGTATGTAAGGGCCTTATTTGACTACGATCCTGTAAGAGATGACGGTCTACCATCACGAGGACTTCCTTTCCGCTACGGCGACATCCTTCACGTCACCAACGCCTCTGACGATGAATGGTGGCAGGCGAGACGCCTGGACTCTTCTGATGCGGACGGTGTAGGTATCATACCTTCTAAACGACGCTGGGAAAGGAAACAACGAGCAAGAGATAGACAAGTCAAGTTCCAAGGACAAGGAACACCCGTCAGCACAAGCCAGTCGACCTTAGAGAGGAAAAAGAAAACTCTATCGTTCAGCAGGAAGTTCCCCTTTATGAAGAGTCGAGAAGACGGCAAGTCCGAAGACGGCTCGGACCAGGAGCCTTTCATGCTTTGCTACACCCAGGAGGACGCTAACGCGGACGGGGAAATCTTGTACCGAGTGGAACTTCCCATGATGGAGGAGATAACGCTCATATATCTCGAGGACGATTGTCAAGACGAGGCGGTGTTGTCGTATGAGACGGTCCAGCAATTGACCATAAATTATACAAGGCCCGTGATAATACTTGGACCTCTGAAGGATAGGATCAACGATGACCTTATATCCGAGTTTCCTGACAAGTTTGGAAGCTGTGTGCCTCATACAACTCGTCCACGCCGTGATTACGAGGTCGACGGTCGGGACTACCACTTCGTGTCGAGTCGGGAACAGATGGAGAGAGATATACAGAACCACTTGTTTATAGAGGCGGGTCAGTACAACGAGAATCTGTACGGCACATCGGTCGCCTCAGTCAGGGAAGTCGCCGAGAAGGGAAAGCATTGCATTTTGGATGTCAGCGGAAACGCCATTAAGAGGTTGCAAGTAGCCCAACTATATCCCATTGCCATATTTATTAAACCAAAGAGTGTAGAATCAATTTTGGAAATGACGAAGCGAATGACTGAAGAACAAGCGAAGAAAACTTACGAACGCGCTCTTAAAATGGAGCAAGAATTTGCAGAATATTTCACAGCGGTGGTGACCGGCGATACTCCTGAAGAGATCTACGCGCGTGTGAAGGCGGTCATCACGGCTGAGAGCGGGCCGAGCGTGTGGGTGCCGCGTCGGGAGCCGCTCTGA

Protein sequence:

>DPOGS214049-PA
MPVRKQEAHRALELLEDYHSKLVKPQDRQLRLAIERVIRIFKSRLFQALLDIQEFYELTLLDDNKSIQQKTAETIRIANKWEADGVRRESQTGDNDFRSVSNAFLQDTNKKSENRNAETPDSARMGTIPSIISPTGDTEEKKAHVSSPEMKQINGIEGGAERTVGDDGHLYLTVVLSRAGGAGLGFSIAGGSDNPHIADDPLIYVTKLIPGGAAAASQLQINDAILQVNDTSVENVTHAEAVDALKKAGSSVKLKIRRRQVEDTLNVSTTSNREEAVEIELVKGGSGLGFSIAGGLGNQHIPGDNGIYVTKIMAGGAAHRDGRLRVGDKLLMVKNTSKGDVNLDNVTHEDAVSALKASGERVQLVLIPGPRHGQPSPRTSRANTPSSTANSLRREDVVDGEEPRVVELEKGPQGLGFNIVGGEDGHGIYVSFLLAGGPAERSGQLRRGDRLLAVNDENITSATHEQAAKALKSTGQNVKLTVVYRPQEYNKFEARINELKQHHTLLRTSQKRSLYVRALFDYDPVRDDGLPSRGLPFRYGDILHVTNASDDEWWQARRLDSSDADGVGIIPSKRRWERKQRARDRQVKFQGQGTPVSTSQSTLERKKKTLSFSRKFPFMKSREDGKSEDGSDQEPFMLCYTQEDANADGEILYRVELPMMEEITLIYLEDDCQDEAVLSYETVQQLTINYTRPVIILGPLKDRINDDLISEFPDKFGSCVPHTTRPRRDYEVDGRDYHFVSSREQMERDIQNHLFIEAGQYNENLYGTSVASVREVAEKGKHCILDVSGNAIKRLQVAQLYPIAIFIKPKSVESILEMTKRMTEEQAKKTYERALKMEQEFAEYFTAVVTGDTPEEIYARVKAVITAESGPSVWVPRREPL-