Monarch geneset OGS2.0

DPOGS203696
TranscriptDPOGS203696-TA2289 bp
ProteinDPOGS203696-PA762 aa
Genomic positionDPSCF300010 - 1734746-1742775
RNAseq coverage641x (Rank: top 20%)
Annotation
HeliconiusHMEL0133058e-5154.30% 
BombyxBGIBMGA003488-TA0.082.78% 
DrosophilaPatj-PA2e-10950.87% 
EBI UniRef50UniRef50_D6W6M50.052.79%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6W6M5_TRICA
NCBI RefSeqXP_001813830.10.052.79%PREDICTED: similar to GA11344-PA, partial [Tribolium castaneum]
NCBI nr blastpgi|2700144140.052.79%hypothetical protein TcasGA2_TC001640 [Tribolium castaneum]
NCBI nr blastxgi|2700144140.052.38%hypothetical protein TcasGA2_TC001640 [Tribolium castaneum]
Group
Gene OntologyGO:00055151.3e-26protein binding
KEGG pathwaybta:5368632e-120 
 K06095 (MPDZ, MUPP1)maps-> Tight junction
InterPro domain[290-421] IPR0014781.3e-26PDZ/DHR/GLGF
Orthology groupMCL10605 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203696-TA
ATGCATCTGGGCGTAAATATATCAAATGCTCTCCAACAGTTAGAAAGTGTAAAGACAGCTGTAGATCAAAGTGATGATCCTAAACTAAAGGCAGCTACAAATGATGACCTTAACCTTCTGATTAATCTCTTAGAAAGTCCAATTTTAAAAAGCATTGCCACCTTACATGATTCTGTAGGAATGCTTGCCACACAGGTTGCACATCATCCATCTATACTTCCTGAAGATTTTGATATAACACCAGCTGGAGATTTAGCATTGCAAGCAAGGAACCTATATGGAGCTCATGAAGGTGAAGAAGAACAGAGAGTGCCTCAAGTGTCTCCACCTCACAGTCTCGAGTTTGGATCTGGATCTGACACAGAACGAATTCTTGGCCTCGATAATGCTTCTGTTGATTCCAATATGTCCCCTAAGCAGAACCGTGGATTACTAGAAATGAATAGAAATGATGACTCATTGGTCTCAACGTCTACTAACATTGTAGCTGGTGACTGGGCCCAAGTTGAAATAATTAATTTAGTGAATGATGGAACTGGACTCGGATTTGGCATTATAGGAGCCAGAACAAGTGGTGTTATTGTGAAGACAATACTGGCGGGAGGTGTTGCAGACAGAGATGGCAGATTGAAATCGGGAGATCATATTTTACAAATTGGTGATGTCAGTCTGATGGAGATGGGTTCAGAGCAAGTGGCGGGTGTCCTTCGTGCTTCGGGGTCCCGAGTGCGACTGGTGGTGGCTCGTGCTGTGGACCCCGCAGACCCAGCTACCCTAAATGCTTCAGCTGCTCCACTAGTGCCCGCTAGATCTGAGGAATCAAGCCCTGAAGCGTTAGATCGTTACTTGATGGAAGCTGGTTTTGAACAAGTATTCCACACTCAGCCGACTCCGATAGCCCTGAACACTAATGTGTCGAGATTTGTTTTCGATAGTTCAATTACCCCTCCTCCCGACTCAGATGCAGAATCTCCTGAAGTAGACAGATTCACGGTGCAGTTAAAGAAAGATGAAAACGGTCTTGGGATTACAATCGCAGGATATGTCTGTGAAAAGGAGGAGTTGTCCGGTATTTTCGTGAAGTCAGTGACCCCTGGCAGCGCTGCCGCACTGAGCGGTAAAGTGAGGGTCAACGATCGTATAATAGCTGTTGATGGAGTGTCCCTCGCTGGAAAGTCCAACCAGAGAGCGGTAGACGCTCTCAAACAGAGCGGAAATGTTGTTACTCTGGAACTTGAAAGATATCTCCGCGGTCCTAAGTTCGAGCAACTGCAGCAGGCGATAGCTGCCGGCGAGTCGTCGACGGCGCCGCCCGCTCACAACCCCTTCCTACACATGCACCGACCTGAACATTCGGACGACATGACGTCACAGACCACACACCTCCCACCGCCAGTGGAGATGCCGGTAGAAGAACCCGAGACTCCGTCGGTCCCCGCATTCGAGATGCCTCGCACGCAGGAACAGAAAGACGCCATAAAGAGGAAATGGCAGTCTATACTTGGTGATGACGTGGAGATTGTTGTGGGTGTGGTGGTACGTGGTGGTGGCGGTCTGGGGATATCTCTGGAGGGCACTGTGGACGTGGAGGGAGGGAGAGAGGTTAGACCACATCACTACGTACGGTCCGTACTACCGGAAGGACCGGTCGGGAGGGCCGGGGTTCATCGACCCGGGGACGAATTACTCGAGGTGAACGGTCACCGTCTGCTGGGTATGAACCACCTGGAGGTGGTGTCGATCCTGAAAGAACTGTCCAGCGAGGTGTGCATGGTGTGTGCGAGACCGAGACCAGCGCCGCCGCTAGACCTCGCGCCGCCCGCGGCTACACTCGTTAAGGCGAAGTCGGATGGCAGTCTAGCGGGCGCGGGGGCGGAGGACGGGGGGTCATTGACCGCAGGGGGAAAGGTTCGATCAAGGAGTCTCGAACCACTCACAGGACTAGCGATGTGGGCGTCCGAGCCACAGATTATTGAGCTGGTGAAGGGCGAACGAGGTTTGGGCTTCTCCATCCTTGACTACCAGGATCCCCTGCGTCCCTCGCACACCCTGGTGGTGATACGGTCCCTGGTGCCGGGCGGCGTGGCCCAGCAGGACGGCAGGCTCATACCAGGCGACAGGCTCCTCTTCGTCAATGAACAGAACTTGGAGAACGCTAGTCTAGAACAAGCCGTCGCTGCTTTAAAGGGCGCCCCCCGTGGCGTGGTCCGTATTGGCGTGGCGAAGCCTCTACCACTAACCGACGGTCCCCCACCACTACCGGCTACCTCCCCACCACCACACCACTAG

Protein sequence:

>DPOGS203696-PA
MHLGVNISNALQQLESVKTAVDQSDDPKLKAATNDDLNLLINLLESPILKSIATLHDSVGMLATQVAHHPSILPEDFDITPAGDLALQARNLYGAHEGEEEQRVPQVSPPHSLEFGSGSDTERILGLDNASVDSNMSPKQNRGLLEMNRNDDSLVSTSTNIVAGDWAQVEIINLVNDGTGLGFGIIGARTSGVIVKTILAGGVADRDGRLKSGDHILQIGDVSLMEMGSEQVAGVLRASGSRVRLVVARAVDPADPATLNASAAPLVPARSEESSPEALDRYLMEAGFEQVFHTQPTPIALNTNVSRFVFDSSITPPPDSDAESPEVDRFTVQLKKDENGLGITIAGYVCEKEELSGIFVKSVTPGSAAALSGKVRVNDRIIAVDGVSLAGKSNQRAVDALKQSGNVVTLELERYLRGPKFEQLQQAIAAGESSTAPPAHNPFLHMHRPEHSDDMTSQTTHLPPPVEMPVEEPETPSVPAFEMPRTQEQKDAIKRKWQSILGDDVEIVVGVVVRGGGGLGISLEGTVDVEGGREVRPHHYVRSVLPEGPVGRAGVHRPGDELLEVNGHRLLGMNHLEVVSILKELSSEVCMVCARPRPAPPLDLAPPAATLVKAKSDGSLAGAGAEDGGSLTAGGKVRSRSLEPLTGLAMWASEPQIIELVKGERGLGFSILDYQDPLRPSHTLVVIRSLVPGGVAQQDGRLIPGDRLLFVNEQNLENASLEQAVAALKGAPRGVVRIGVAKPLPLTDGPPPLPATSPPPHH-