Monarch geneset OGS2.0

DPOGS207563
TranscriptDPOGS207563-TA4587 bp
ProteinDPOGS207563-PA1528 aa
Genomic positionDPSCF300072 - 703595-722328
RNAseq coverage81x (Rank: top 64%)
Annotation
HeliconiusHMEL0171470.069.52% 
BombyxBGIBMGA004713-TA0.071.91% 
DrosophilaRhoGAP93B-PA5e-17267.32% 
EBI UniRef50UniRef50_Q7PQD33e-17169.04%AGAP004524-PA n=1 Tax=Anopheles gambiae RepID=Q7PQD3_ANOGA
NCBI RefSeqXP_001657574.10.053.03%hypothetical protein AaeL_AAEL006191 [Aedes aegypti]
NCBI nr blastpgi|1571125780.053.03%hypothetical protein AaeL_AAEL006191 [Aedes aegypti]
NCBI nr blastxgi|1571125780.045.05%hypothetical protein AaeL_AAEL006191 [Aedes aegypti]
Group
Gene OntologyGO:00071651.6e-39signal transduction
GO:00056221.6e-39intracellular
GO:00058562.2e-20cytoskeleton
GO:00055154.2e-07protein binding
KEGG pathwaybfo:BRAFLDRAFT_664869e-11 
 K07526 (SRGAP)maps-> Axon guidance
InterPro domain[1329-1523] IPR0089361.6e-39Rho GTPase activation protein
[1327-1520] IPR0001981.5e-33Rho GTPase-activating protein domain
[1199-1318] IPR0008572.2e-20MyTH4 domain
[547-592] IPR0012024.2e-07WW/Rsp5/WWP
Orthology groupMCL13356 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207563-TA
ATGTTTTATTCCAGAGTGGAATGGGTGGAGATCATAGAACCGAAGACAAAGGAGCATATGTATGCTAACCTAACAACTGGTGAATGTGTATGGGATCCTCCTGAGGGTGTGAAAGTGAAACGCACAGACTCTTCTCAATGGTGGGAATTATTTGACATCAATACTCATAGGTTTTACTATTATAATGCATCTACTCAGACCACTGTTTGGCATAGACCAACAGATTGTGATATTATACCACTAGCGAAATTACAAACATTGAAACAGAATACCGATCCTCAAAAGAGAAGAGAGACATCAACACAAACAAATCAAGCAGTGCCGCAGGTGAATATTCAAATCCTTTTTGCCTTGACCACTTTGTTCAACTTTAGCAGTGGTGAACGTCGCTGTCGTCTGAGCGAGGACAGCGCTGCGCTCTGTCGCCACACAGATTCTGGTCGTTCCTCTGACAGTAGCCTTAGTAGTGCCCAGCAGAGGAGGAAACAGCAAGAGTTAAGTGCGTGTGGCGCGGCTTGTACTCCTCAGCTGCGCAAGCGCAGTTCGGCCCAGCCTCACGAACCAGACAAGTGGTCCCCTCACACCGCTCTCAATCGTAGCGGAAGTTTTATTTCACCACGAAAAGATGCTTCAGACGACTCGATGCACGAGAAGTACTTTAAATCTGTCGAGAGCACGCCGCTAACGAGGCGTAAGGCGAGCGCGGCTCGTCCGTCACTAGTGTCGTCTATTTCGTTGGCTGAGAACACTCGCGCCATGCACTCGCTGGAGAGACCTCACCTGTCACGACACACTAAGGACCGCTACTCCGACAGGCATTTAGACAAAGAGCGTCTAGCCGAGCTTGAGCGAATGGAGCGGTATCCTGAGAAGCAGGCTGTATACAGTGTGGACAAACCCTTCAGACAAACCAGCCTGGACCTACGTCACAACGCCTCCAGCTGGCATCCTCCGCGAGAATTTACTAGACGTCAACAAGCCGAACCCGAGCGTTATACCTCCAGCAAGACGTCAGTGTCCTCTGGCAGTAGCAAACAAAGGTCCAACGAGCAACATAACTTCATGAGGCTGTCTTCCGCCAACGAACAGGACAATATAGCTGCTGTGAATTATAAAATGAAATGCGTCAGCTTGGAGCCGACGTTGACAGAGCCTAATGTACAGAAACACAATCAACGTCTAGAGAAAACAACGACGCCCAGGGATAGAACGAAATCACGACGTCACAAGCGTGTGGAGGAGGAAGCGGCCTTGGAGGGTGACGAGGAGGACGAGGACGGGTCGCCGCTGTACTGTAACTGGGACCTTAGAGGCATGCAGCACTTACTCCCTCTACAGGACTACATAATACAGCAGGCTAAACTATCCAGTCGCGGTCGTTATGCGGGTTCGGATTCTGACGGTGAGTCCGGGAGCTCGAGGTCGCGGTCGGGTTCGGAGTCCCGGTCTCTATCCGGACACGAGCCTGACAACGAGTCCTCCGACGGCGGCGCGCGGACCCCTGACGACGATGACGGCTCAGGGGTCTACCTCCCGCACACGTACGGAACACACAGACCGACCGACGCCGAATATATGAACCACGGGGTCGCCTATTATAATACGTTCGTTGGATTCGATCGACAACCCTCGCCTGGTATACACAGGGGTGTGAAAGTGAAACGCACAGACTCTTCTCAATGGTGGGAATTATTTGACATCAATACTCATAGGTTTTACTATTATAATGCATCTACTCAGACCACTGTTTGGCATAGACCAACAGATTGTGATATTATACCACTAGCGAAATTACAAACATTGAAACAGAATACCGATCCTCAAAAGAGAAGAGAGACATCAACACAAACAAATCAAGCAGTGCCGCAGGTTCCATCGACACTAGATCCAATCAGTAGCAATGGCAACAGCACGGCAGCAAGTGCTAATAAACTGTCACCAAATAACACAAAGAGTGTTACCATCACACAGACCAGCCCCGTTAGAACAAGAAGATCTAATTATCATCAACGTAGTGCCCAGCAGAGGAGAAAACAGCAAGAGTTAAGTGCGTGTGGCGCGGCTTGTACTCCTCAGCTGCGCAAGCGCAGTTCGGCCCAGCCTCACGAACCAGACAAGTGGTCCCCTCACACCGCTCTCAATCGTAGCGGAAGTTTTATTTCACCACGAAAAGATGCTTCAGACGACTCGATGCACGAGAAGTACTTTAAATCTGTCGAGAGCACGCCGCTAACGAGGCGTAAGGCGAGCGCGGCTCGTCCGTCACTAGTGTCGTCTATTTCGTTGGCTGAGAACACTCGCGCCATGCACTCGCTGGAGAGACCTCACCTGTCACGACACACTAAGGACCGCTACTCCGACAGGCATCTAGACAAAGAGCGTCTAGCCGAGCTTGAGCGAATGGAGCGGTATCCTGAGAAGCAGGCTGTATACAGTGTGGACAAACCCTTCAGACAAACCAGCCTGGATCTCCGTCACAACGCCTCCAGCTGGCATCCACCGCGAGAATTTACTAGACGTCAACAAGCCGAACCCGAGCGTTATACCTCCAGCAAGACGTCAGTGTCCTCTGGCAGTAGCAAACAAAGGTCCAACGAGCAACATAACTTCATGAGGCTGTCTTCCGCCAACGAACAGGACAATATAGCTGCTGTAAATTATAAAATGAAATGCGTCAGCTTGGAGCCGACGTTGACAGAGCCTAATGTACAGAAACACAATCAACGTCTAGAGAAAACAACGACGCCCAGGGATAGAACGAAATCACGACGTCACAAGCGTGTGGAGGAGGAAGCGGCCTTGGAGGGTGACGAGGAGGACGAGGACGGGTCGCCGCTGTACTGTAACTGGGACCTTAGAGGCATGCAGCACTTACTCCCTCTACAGGACTACATAATACAGCAGGCTAAACTATCCAGTCGCGGTCGTTATGCGGGTTCGGATTCTGACGGTGAGTCCGGGAGCTCGAGGTCGCGGTCGGGTTCGGAGTCCCGGTCTCTATCCGGACACGAGCCTGACAACGAGTCCTCCGACGGCGGCGCGCGGACTCCTGACGACGATGACGGCTCAGGGGTCTACCTCCCGCACACGTACGGAACACACAGACCGACCGACGCCGAATATATGAACCACGGGGTCGCCTATTATAATACGTTCGTTGGATTCGATCGACAACCCTCGCCTGGTATACACAGGACGTCGTCCTTGGGTGGTTCGGGTGCTCCTGCGGTGTCCTCTGGTACGGGGAACACGAGTGCGGGCGAGTGCAGCCGAGCGGACGCCTTGTATAGTCCCGTGAGGTCGCACCCACATTCGCACTCCCACTCGCATCCACACTCGCACCCACACACCCACCTACACCCGCCGCCCGAACAGGATATAGAGATATTCGCAAAGGACAATCTCAATTTCAACAAAGGAATATTCAGGAGAAAGGCGTCAGTCCGTGACATGCTTTCTTGGACGTCATCATCGATTAGCGCCCCCATGGTGGGAGCGGAATGGGACAAGACTCATAAAAAGGCCGCCATAGATCTGTTCCGACTCGTACAGATATACATGGGAGACCGCAAAGCCCGCCCCGGTATGACGCTCAACTCTGTTGCCCAGGATATACTTCACGCAACCTTCACCAATGAGAAATTGCGTGACGAGCTGTACGTGCAGCTGTGTCGTCAGACTACAGAGAACCCTCTCCGCGACTCGCTGCTCCGCGGTTGGGAGCTGCTGGCTGTGTGTCTCGCGTTCGTGCCGCCCTCGCCGGCCTTCCAGCCAGCACTCACCAACTACGTCAACAGACACCGCGATCCGGCCTTCGCGGATTGTTTCCCTGAGGTTGGTAAATGGCCTATTCACGTCCAGGTGTCACACTATGCGAGCGTGGCATGTAAGAGATTGGAGAGGATCGGCTTCGGAGGAAAAAGGCAACCAAGAAAACCGAGTACTGAAGATATTGATCAAGCGAGAATTCAAATATTCAGGCAGTCGATGTTCGGTAACACGCTCGCGGAAGTGATGGTGTTACAAAAAGAGAGATTCCCTCACCGACAGCTACCCTGGGTTCAAGTGGCGCTGTCCCAACAAGTGTTACAACTCAATGGTAGAGAGACTGAGGGGATATTTAGAGTGTCAGCTGACGTGGACGAGGTCAACGCCCTTAAAGCTAAAATAGATAATTGGGAACTGCCAGATGCATCTATGCTTACGGATGCTCATGCGCCAGCCAGTCTACTGAAACTATGGTATCGTGAATTGTATGAACCTTTGATACCTGACTCCCTGTACTCGGCGTGCGTCGCTTCGGGCGGGGATTTCCCGGCCTGCGAGCGAGCCCTACAAAGACTACCGCCGCTGAATAGACTTGTACTTACATACTTAATAAGTTTCCTGCAGCAATTTACGGCACCGGAAGTGGTAAGTCAGACGAAGATGGACTCAGCGAATCTGGCTATGGTGTTCGCTCCAAACTGTCTCCGATGTACGTCACAGGACCCCCGCGTGATCCTGGAGAACGCTAGGAAGGAAATGACCTTTCTGAAAACTCTCATCACAAACCTCGATACATCCCACGTCCAAGATCTCCTGTGA

Protein sequence:

>DPOGS207563-PA
MFYSRVEWVEIIEPKTKEHMYANLTTGECVWDPPEGVKVKRTDSSQWWELFDINTHRFYYYNASTQTTVWHRPTDCDIIPLAKLQTLKQNTDPQKRRETSTQTNQAVPQVNIQILFALTTLFNFSSGERRCRLSEDSAALCRHTDSGRSSDSSLSSAQQRRKQQELSACGAACTPQLRKRSSAQPHEPDKWSPHTALNRSGSFISPRKDASDDSMHEKYFKSVESTPLTRRKASAARPSLVSSISLAENTRAMHSLERPHLSRHTKDRYSDRHLDKERLAELERMERYPEKQAVYSVDKPFRQTSLDLRHNASSWHPPREFTRRQQAEPERYTSSKTSVSSGSSKQRSNEQHNFMRLSSANEQDNIAAVNYKMKCVSLEPTLTEPNVQKHNQRLEKTTTPRDRTKSRRHKRVEEEAALEGDEEDEDGSPLYCNWDLRGMQHLLPLQDYIIQQAKLSSRGRYAGSDSDGESGSSRSRSGSESRSLSGHEPDNESSDGGARTPDDDDGSGVYLPHTYGTHRPTDAEYMNHGVAYYNTFVGFDRQPSPGIHRGVKVKRTDSSQWWELFDINTHRFYYYNASTQTTVWHRPTDCDIIPLAKLQTLKQNTDPQKRRETSTQTNQAVPQVPSTLDPISSNGNSTAASANKLSPNNTKSVTITQTSPVRTRRSNYHQRSAQQRRKQQELSACGAACTPQLRKRSSAQPHEPDKWSPHTALNRSGSFISPRKDASDDSMHEKYFKSVESTPLTRRKASAARPSLVSSISLAENTRAMHSLERPHLSRHTKDRYSDRHLDKERLAELERMERYPEKQAVYSVDKPFRQTSLDLRHNASSWHPPREFTRRQQAEPERYTSSKTSVSSGSSKQRSNEQHNFMRLSSANEQDNIAAVNYKMKCVSLEPTLTEPNVQKHNQRLEKTTTPRDRTKSRRHKRVEEEAALEGDEEDEDGSPLYCNWDLRGMQHLLPLQDYIIQQAKLSSRGRYAGSDSDGESGSSRSRSGSESRSLSGHEPDNESSDGGARTPDDDDGSGVYLPHTYGTHRPTDAEYMNHGVAYYNTFVGFDRQPSPGIHRTSSLGGSGAPAVSSGTGNTSAGECSRADALYSPVRSHPHSHSHSHPHSHPHTHLHPPPEQDIEIFAKDNLNFNKGIFRRKASVRDMLSWTSSSISAPMVGAEWDKTHKKAAIDLFRLVQIYMGDRKARPGMTLNSVAQDILHATFTNEKLRDELYVQLCRQTTENPLRDSLLRGWELLAVCLAFVPPSPAFQPALTNYVNRHRDPAFADCFPEVGKWPIHVQVSHYASVACKRLERIGFGGKRQPRKPSTEDIDQARIQIFRQSMFGNTLAEVMVLQKERFPHRQLPWVQVALSQQVLQLNGRETEGIFRVSADVDEVNALKAKIDNWELPDASMLTDAHAPASLLKLWYRELYEPLIPDSLYSACVASGGDFPACERALQRLPPLNRLVLTYLISFLQQFTAPEVVSQTKMDSANLAMVFAPNCLRCTSQDPRVILENARKEMTFLKTLITNLDTSHVQDLL-