Monarch geneset OGS2.0

DPOGS202075
TranscriptDPOGS202075-TA1380 bp
ProteinDPOGS202075-PA459 aa
Genomic positionDPSCF300053 + 1278051-1295803
RNAseq coverage20x (Rank: top 79%)
Annotation
HeliconiusHMEL0128240.076.85% 
BombyxBGIBMGA002451-TA3e-16068.88% 
Drosophiladlg1-PM2e-3444.39% 
EBI UniRef50UniRef50_Q6R0053e-4840.17%Disks large homolog 4 n=22 Tax=Chordata RepID=DLG4_DANRE
NCBI RefSeqXP_001815997.17e-4840.54%PREDICTED: similar to discs large 1 CG1725-PK [Tribolium castaneum]
NCBI nr blastpgi|3330337592e-5441.57%discs large 1 [Gryllus bimaculatus]
NCBI nr blastxgi|1892343871e-5340.54%PREDICTED: similar to discs large 1 CG1725-PK [Tribolium castaneum]
Group
Gene OntologyGO:00055154e-25protein binding
KEGG pathwayxtr:7339372e-51 
 K12076 (DLG1)maps-> T cell receptor signaling pathway
InterPro domain[103-224] IPR0014784e-25PDZ/DHR/GLGF
Orthology groupMCL26513 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202075-TA
ATGAGCAGGTGGATGGCGCTTTTCGGGTTAGTGTGGTTGCTGCGAGTTCGTGGATCACTCTGCGGGTGTACTGCTTGCAAAAGAGCAAGACTGGATACTGACGCAAGATGTAGGACATCGGGTGTACAATATCCAGGTGAACAGGTACCTGGGATTGAACAAACATCGCTGGACGAACGAGCGCCCTTTGATGTTATCGCTCTGCAACAGTATACCCGTGATGTCACAGTTCAAACAGATTTCGATGATGAAATTGAGGACTTGGAAAAAATTGATATTAAAATTAACGAAGAGGCAAACGTTAGTAACTACGAAAGCGTTTGTGAACAGTCTACTAGTCCGGAAAACGGGCAAAAGATCACAGGAAGCTATCAGTACTTAATGCATCACAGTACGTCAAATCCGGATGATGTATGGGAAACGACGGATGTGACGCTAGAGAGGGGGGCAAGTGGTCTAGGGTTGAGCATAGCCGGCAGCGAAAGTGGAGGTGACATCAGCATAACTAGAATAGCACCCAACGGCGCTGCCAAGGCAGACGGCCGGCTTCAGACAGGCGACATTCTACTACAGGTCAATGATATTTCTGTGGAAGGTGCACCGCATTCAGTAGCAGTAGACGCATTACAAAAAGCTGGAAATGTCGTCAGATTAAGAGTTCGAAGAGCAAGAAGACCAATGGTTGTGACACTCTGCCGGGGTGCTCGGGGTCTCGGTTTAGGTATAGCTGGAGGGGCTGACGACGCAGGGGGCGAGGGAATTTTTATATCACACATAGCTGTGGGCGGCGCCGCTCACCATGACGGCAGACTGAAATTAGGCGATAAAATATTAGCTGTAAGAGATGAACATGGCATGGAAACATCCCTTATAGGAGTAACCCACGCGTGTGCTGTCTCAGCTTTACGTAGTACAGGAGACCATGTCACCCTGGTAGTATTACCAGCAGCAGGATCGATTCCACCCGTCGCCAGAACCACACCTCTTTATTCCATGCGTACTGGTTCGAGACTAGGAATGGATATAGTTGGTGGTTTGGGCGGAGAAATAGATGGAAACTGCGTTGAAGATGACACTTGTGGCGTATTTGTATCTGGTGTGTCAACGGAAGGAGCAGCTTACGGATTATTGCACAGAGGTGATAGAATACTCAGTGTTAATGGCCGTGATCTGACTCGAGCGACGCACGAACAAGCAGCAGCAGCGTTAAAGCAGTATTCGGGGAGCGCCGTTACTATCGCAGCTCAGTATCAGCCGGAACAATACGAGAGGCTGCGCGCACGCATACGAGCAATTAACGCCACGGCAATGTCGCCACATTCCTACACACGCACTCATATACCTGTGAGGCCGGATGTCCATGCTGCTTACCCAAGGTAG

Protein sequence:

>DPOGS202075-PA
MSRWMALFGLVWLLRVRGSLCGCTACKRARLDTDARCRTSGVQYPGEQVPGIEQTSLDERAPFDVIALQQYTRDVTVQTDFDDEIEDLEKIDIKINEEANVSNYESVCEQSTSPENGQKITGSYQYLMHHSTSNPDDVWETTDVTLERGASGLGLSIAGSESGGDISITRIAPNGAAKADGRLQTGDILLQVNDISVEGAPHSVAVDALQKAGNVVRLRVRRARRPMVVTLCRGARGLGLGIAGGADDAGGEGIFISHIAVGGAAHHDGRLKLGDKILAVRDEHGMETSLIGVTHACAVSALRSTGDHVTLVVLPAAGSIPPVARTTPLYSMRTGSRLGMDIVGGLGGEIDGNCVEDDTCGVFVSGVSTEGAAYGLLHRGDRILSVNGRDLTRATHEQAAAALKQYSGSAVTIAAQYQPEQYERLRARIRAINATAMSPHSYTRTHIPVRPDVHAAYPR-