Monarch geneset OGS2.0

DPOGS205600
TranscriptDPOGS205600-TA1704 bp
ProteinDPOGS205600-PA567 aa
Genomic positionDPSCF300167 - 129998-137262
RNAseq coverage678x (Rank: top 19%)
Annotation
HeliconiusHMEL0135632e-12463.51% 
BombyxBGIBMGA007148-TA2e-16771.19% 
DrosophilaCG32758-PA2e-17863.90% 
EBI UniRef50UniRef50_Q9W4863e-17663.90%CG32758 n=19 Tax=Endopterygota RepID=Q9W486_DROME
NCBI RefSeqXP_967060.10.067.22%PREDICTED: similar to sorting nexin [Tribolium castaneum]
NCBI nr blastpgi|910781720.067.22%PREDICTED: similar to sorting nexin [Tribolium castaneum]
NCBI nr blastxgi|910781720.067.22%PREDICTED: similar to sorting nexin [Tribolium castaneum]
Group
Gene OntologyGO:00055153.3e-26protein binding
GO:00071543.3e-26cell communication
GO:00350913.3e-26phosphatidylinositol binding
GO:00071651.1e-10signal transduction
KEGG pathway 
InterPro domain[154-269] IPR0016833.3e-26Phox homologous domain
[8-139] IPR0014785.4e-24PDZ/DHR/GLGF
[277-360] IPR0001591.1e-10Ras-association
Orthology groupMCL13020 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205600-TA
ATGGCTGAGACAGAAACCGATAATAACAGTATTGTAAGACCTGAAAGAAATAACAAAAGTATAGTGTCTAATAATGGGCCCCGCCGCGTTACCATTTATAAAACTGAGACTGGTTTTGGTTTTAACGTCCGCGGCCAAGTTTCTGAGGGCGGACAGCTGAGATCGATTAATGGGGAACTTTATGCCCCTCTGCAGCATGTCAGTGCTGTATTGGAGCAGGGAGCGGCTGAGCAGGCCGGTATTAGAAAAGGAGATAGAATACTCGAGGTTAACGGTGTGAATGTAGAAGGCTCCACTCACAAGCAAGTCGTTGATCTGATCAAGTCTGGAGGCGACTGTCTTACGCTCACCGTCATCTCCGTAACACCGAAGGAAGCTGAGCGCTTAGAGCCTTCAGACGATTCAGCGCCGGTAGGTGTCATAGGAGGTGCCGCTAACTCCAGGGCTACGGTCTACCAAAGGTATGACTACACGGACAAGAGGTCGTTACCCGTCTCCATACCCGACTACCGAGTGGTGATGGAGCGGAACACCGGCAGATCCTACGTAGCATTCAATGTACACATGGCCGGCAGACATCTCTGTAGCAGACGCTATAGAGAATTTGCAGCGCTCCATCAACAACTTAGGAAGGAGTTTCTTGGTTTCAGCTTCCCCAAACTCCCCGGCAAGTGGCCCTTCACACTCAGCGAACAACAACTGGATGGCAGGAGACGCGGCCTGGAACAGTACTTAGAGAAGGTGTGTGCAATCCGTGTGATAGCTGAGAGTGACGCCGTGCAAGAGTTCCTCACCGACTGCGACGACTCTTGCAACCCCTCGCCCGTGGAGCTAAAGGTGCTTCTGCCGGACAAGGAGGTGGTCACCGTGGCCGTGCTCAAATCCACCAACGCCGACGACGTCTACAAGGCCGTCTGCGACAAGATAGGTCTAGCCAGGAACATACAGAACTACTTCTATCTCTTTGAAATCGTTGAATACAATTTTGAGCGTAAACTTCAACCGAACGAGTGTCCCCACTCGCTGTACATCCAGAACTACTCCACCGCGTCCTCGTCGTGTCTGTGTGTCCGCAAGTGGCTCTTCAGACCCGACACCGAGCTCGACCTGCTGAGAGACGACACCGCCGCCGCTTTCATATTTTGGCAGGCGGTGGAAGATGTGAACCGCGGCGTGTGCTCGGCCGGAGCCCGCCTCTATCAGCTGAAGGCTCTACAGGACGTGCGTCGCGCTCGGGACTACCTGGCGCTGGCGAGGACGTTACCCGGATACGGAGACGTCGCCTTCCCGCCCGCCCGCACCGACTGCCGCGCCGCGCCCGCCCTCGCCATCTCAGTCGGTTGGGAAGGTATAAAGCTGTCGTCGTGGAGCGAGTCGTCCGCGTGCGAGGGCGGCGTAGTGAACGTGCCCTGGACGGATGTGAGGGAGTGGCGCGCGGACGACGACGCGGCCGCCGCCCGTCTCGTCTACAGGAGGAGGGACAGGCCGCCCAGGACTATAGCACTGCATTCGCCATATGTCAGTATATACGGACAAACAAACACACACACACATGGTCTCATACATTTTATACGTTATCAAAATTATAATATGCGAGATCAAAATCAGTATATATGGCTATACACACACACTAACGCACATGCGACACAGACGCACCATGTAGGAGGGGAAAGAGTAACTAGTGATATAGTACGAAATATATAA

Protein sequence:

>DPOGS205600-PA
MAETETDNNSIVRPERNNKSIVSNNGPRRVTIYKTETGFGFNVRGQVSEGGQLRSINGELYAPLQHVSAVLEQGAAEQAGIRKGDRILEVNGVNVEGSTHKQVVDLIKSGGDCLTLTVISVTPKEAERLEPSDDSAPVGVIGGAANSRATVYQRYDYTDKRSLPVSIPDYRVVMERNTGRSYVAFNVHMAGRHLCSRRYREFAALHQQLRKEFLGFSFPKLPGKWPFTLSEQQLDGRRRGLEQYLEKVCAIRVIAESDAVQEFLTDCDDSCNPSPVELKVLLPDKEVVTVAVLKSTNADDVYKAVCDKIGLARNIQNYFYLFEIVEYNFERKLQPNECPHSLYIQNYSTASSSCLCVRKWLFRPDTELDLLRDDTAAAFIFWQAVEDVNRGVCSAGARLYQLKALQDVRRARDYLALARTLPGYGDVAFPPARTDCRAAPALAISVGWEGIKLSSWSESSACEGGVVNVPWTDVREWRADDDAAAARLVYRRRDRPPRTIALHSPYVSIYGQTNTHTHGLIHFIRYQNYNMRDQNQYIWLYTHTNAHATQTHHVGGERVTSDIVRNI-