Monarch geneset OGS2.0

DPOGS204642
TranscriptDPOGS204642-TA2160 bp
ProteinDPOGS204642-PA719 aa
Genomic positionDPSCF300277 + 232685-253948
RNAseq coverage578x (Rank: top 22%)
Annotation
HeliconiusHMEL0094542e-10959.50% 
BombyxBGIBMGA009471-TA3e-7859.71% 
DrosophilaGraf-PF4e-13344.41% 
EBI UniRef50UniRef50_UPI00020625880.043.34%UPI0002062588 related cluster n=1 Tax=unknown RepID=UPI0002062588
NCBI RefSeqXP_001122822.10.045.84%PREDICTED: similar to Graf CG8948-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3838608150.045.92%PREDICTED: rho GTPase-activating protein 26-like [Megachile rotundata]
NCBI nr blastxgi|3504146890.046.64%PREDICTED: rho GTPase-activating protein 10-like [Bombus impatiens]
Group
Gene OntologyGO:00071651.1e-54signal transduction
GO:00056221.1e-54intracellular
GO:00055155.2e-16protein binding
GO:00057371.1e-11cytoplasm
GO:00468472e-06filopodium assembly
GO:00171242e-06SH3 domain binding
GO:00080932e-06cytoskeletal adaptor activity
KEGG pathwaymmu:785149e-128 
 K13736 (ARHGAP10)maps-> Bacterial invasion of epithelial cells
InterPro domain[369-542] IPR0001981.1e-54Rho GTPase-activating protein domain
[359-550] IPR0089363.4e-45Rho GTPase activation protein
[663-719] IPR0014525.2e-16Src homology-3 domain
[22-225] IPR0041481.1e-11BAR
[37-216] IPR0136062e-06IRSp53/MIM homology domain (IMD)
Orthology groupMCL10369 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204642-TA
ATGGGCGTGGGCCTGCAACCGCTGGAGTTCACCGAGTGCCTGGCAGACAGCCCGCACTTCCGAGAGAACCTACAGCGTCATGAGAAGGAGCTGGAGCGGACCAGCCAGCAGATCAAACGTCTCATCAAGGAGGTCAAGGATGTCGTGCAAGCCGCCAAACGTCTGGGTGCGGCTCAGCTGGCTTTGGCGGCCAGTATGGAGCAGTTCGAGTTCGCGTGCATCGGAGCCTCTATGACTGAAGATGAGAGGGTCATCGGAAGATCCTTGCATCACTTCGCTAACCTCATCAGAACTATAGAGGATGAAAGAGATAGAATGCTGGGTCGAGCACACGAGCAAATTATACAGCCTCTGGAGAAGTTCAGGAAGGAACATATTGGTGCTGTTAAGGAAGGCAAGAAGAAGTTTGACAAAAAGACTGCAAAATTCTGTCAGAGTCAAGAGCGCACTTTGTCGCTATCAACCAAAAAACCAGAAGCTGTCTTCCAAGAAGCCGATGCGGCGATGGATATGGCGGAGCGCGACTTCTGCCAGGCGTCCCTGGAGTACGTGTTCCAGTTGCAGGCCGTCCAGGAGAGGAAGAAGTTCGAGCTGGTCGAGACGCTGCTGGGGTTCGTGTTCGGCTGGTGGACCTTCCATCACACGGCGCATGACGTGCACGCTGACGCCGAGCCGCTCGTCAGAGACCTGCAGCTCCGGATACAGAGGACGAGAAGTAACTTTGAAGAGACCAGCAAACAGACGGAGTCGCTGATGAAAAAGATGATGGAGGTCAGGCAGATGGCGTTCGGTACTACCTGGAGCAAACAGTACTGCACATATGAGAAGATGACGAGCACACTCACACTGATGCCATACAACCAGATTAATGTAAAGACGGCTGGTCCCGTGGAGAGTGTGGTGGTGGTGGGAGCGCGGCCCGTCACGGATGCCGAGAGGAGGTTCTGCTGGGAGGCTCTGGTGGAGGAGAAGCCTCCGCTGGCGCTGCAGGCCGCCGCCGACCGGGAGCGCGCCGCCTGGATCAGGACGCTCAGGCGGGCCGGCGCCCCGCACACGGACACGCCCGCGCCCAGGGCCAGCGACGGCGAGCTCTGGCCGCTCGACGACGCCGGCTTCGAGTTCGTCAGGAGGCTGGCGACTGAGCTTGAGGCCCGGGGGCTCGACGACCAGGGACTGTACCGGGTGGCCGGCGTGTCGTCCAAGGTGTCTCGTCTGGTGTCCCTGGGTCGCTCGGGGCGCTTGCCCCCGTCGCTGGAGTCGTTCGAGTCCCGCACGCTCACCTCCGCCCTCAAGAGCTACCTCCGAGCGCTGCCCGACCCTCTGCTCACGCGACGCCTCCACGACGACTTCCTCGCCGCCGCCAAATGCGAGCGTTCCTCGGAGCGCGTGTCCCGCCTGTACTCGCTGGTGCGCGCGCTGCCGCCCGCCAACCGCGCCATGCTGCAGCTGGTGCTGGCCCACCTGGAGCGCGTGGCGGCCAGGAGTGACGTCAACCTGATGACGTCATCCAACCTCGCCGTGTGCTTCGGGCCGACGCTGTTGAGAGCGGAGCGGGAAACCGTGGCCTCCATACTGGAGCTGAAGTTCTACAACGTGCTGGTGGAGGCGCTGCTCGACAATATATCCGCGGTGTTCGCGCCTCTCCCGCCCGCCGCTGTGCCGCCCGCTGAAAACCACAACGGTATCGCTGGAACATCTCCGTCTTCGATACCTCTCGCATCTCGCAATGATATTAGTGTGTGTGACCGCTCCCTGGTGACGTGTGGGGGTTCAAGTGTGTCAGACGTGGGTGTCTCTGGAGCCGCGGTAGGGAACTACTCCCCGCATCATCACCAACTGTTGCAGCACTTCTCGAACGCACACACACGTGTTGGTCGTTGTAGCAGCAGTTCGGAGTCGGTGTCGAGTCACAGCGCCTCCCCCCCACCCCGCGCCCCGCTCACACACAGCATACACAACCCTTCGCTCGCCTTCCCGCCGAGGACCGCGCGTGTGCGGACCTTGTACGCGTGTCTGGGCGAGAGCGAGGGCGAGCTGTCCTTTGAACCGAACCAGATCATAACGAACGTGTCTCCGTCCGCCGAGCCCGGCTGGCTGAGGGGCTCGCTCAACGGGAAGAGTGGCCTCGTGCCGCAGAACTATGTGGAGCCCCTGCCTTAG

Protein sequence:

>DPOGS204642-PA
MGVGLQPLEFTECLADSPHFRENLQRHEKELERTSQQIKRLIKEVKDVVQAAKRLGAAQLALAASMEQFEFACIGASMTEDERVIGRSLHHFANLIRTIEDERDRMLGRAHEQIIQPLEKFRKEHIGAVKEGKKKFDKKTAKFCQSQERTLSLSTKKPEAVFQEADAAMDMAERDFCQASLEYVFQLQAVQERKKFELVETLLGFVFGWWTFHHTAHDVHADAEPLVRDLQLRIQRTRSNFEETSKQTESLMKKMMEVRQMAFGTTWSKQYCTYEKMTSTLTLMPYNQINVKTAGPVESVVVVGARPVTDAERRFCWEALVEEKPPLALQAAADRERAAWIRTLRRAGAPHTDTPAPRASDGELWPLDDAGFEFVRRLATELEARGLDDQGLYRVAGVSSKVSRLVSLGRSGRLPPSLESFESRTLTSALKSYLRALPDPLLTRRLHDDFLAAAKCERSSERVSRLYSLVRALPPANRAMLQLVLAHLERVAARSDVNLMTSSNLAVCFGPTLLRAERETVASILELKFYNVLVEALLDNISAVFAPLPPAAVPPAENHNGIAGTSPSSIPLASRNDISVCDRSLVTCGGSSVSDVGVSGAAVGNYSPHHHQLLQHFSNAHTRVGRCSSSSESVSSHSASPPPRAPLTHSIHNPSLAFPPRTARVRTLYACLGESEGELSFEPNQIITNVSPSAEPGWLRGSLNGKSGLVPQNYVEPLP-