Monarch geneset OGS2.0

DPOGS212033
TranscriptDPOGS212033-TA3027 bp
ProteinDPOGS212033-PA1008 aa
Genomic positionDPSCF300054 - 118235-135565
RNAseq coverage166x (Rank: top 51%)
Annotation
HeliconiusHMEL0084389e-17685.02% 
BombyxBGIBMGA010002-TA0.090.25% 
Drosophilacv-c-PD0.040.97% 
EBI UniRef50UniRef50_UPI00022470EF0.048.68%UPI00022470EF related cluster n=1 Tax=unknown RepID=UPI00022470EF
NCBI RefSeqXP_001606903.10.049.40%PREDICTED: similar to CG31319-PA [Nasonia vitripennis]
NCBI nr blastpgi|3504224510.050.89%PREDICTED: hypothetical protein LOC100745795 [Bombus impatiens]
NCBI nr blastxgi|3504224510.050.66%PREDICTED: hypothetical protein LOC100745795 [Bombus impatiens]
Group
Gene OntologyGO:00071659.8e-50signal transduction
GO:00056229.8e-50intracellular
KEGG pathwaytad:TRIADDRAFT_599781e-16 
 K12490 (ARAP)maps-> Endocytosis
InterPro domain[525-767] IPR0001989.8e-50Rho GTPase-activating protein domain
[537-767] IPR0089367.3e-42Rho GTPase activation protein
[804-1000] IPR0029135.9e-24Lipid-binding START
[800-980] IPR0233932.7e-18START-like domain
Orthology groupMCL10586 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212033-TA
ATGCAGAATGGGACCGTTTTTGAAGAAGATCTGCAATTTCCAATCGACGTGTCCGGAGTTCAAAACGACCACCCATTTTTGGACGCGGACTCGCTCCAGTCATTGTTCAGACGACTCACCGCCCTCAACAGATGTGCCACCATGAAGTTAGAGCATCATCATCATCAGAAGAGATCGAACGACTCTGACGACGAACAATGCGCATTAAGCGATAAATGGCAATACGAGAGAAAATCTCGAAGATGGTCCCGGGTCGTGGAAATTACGCCTCAAGCACAAAGGAGATTACAGGTGATAGCAGCCAGAGCGCTAGCAGAACGAGAGGCAAGAGAGGCAAGAGGTGAAGTGGAAGATGATGAGGATGAAACTCTACCAACCATCAGGGTTGGTCTGGTTGACGCGGACGGACATTACACAGACGTAGAACCAGATCTTCTGACAGTACCCGGACACCATATGCAGTTACCAAGTGATAAAGAAGATGAAGAGGACACTGGAACCAGGTTCAGGCGAACTGGGTCCGAAAGATTACGGGACGGGGCGAAGGCGTTATTAAGGAGGGTCGAGTCGCTGAAAACGAGAAGAAGGAAACGGCAAAACAGAACGGAAGTTATCGTTTCATCGCCACACTTTCTGGACGTCAATCAGGACAAATATGCTGATTTGAATTACATAGATATGACGCCCACTACTCCAACAGCCTTTCCCTTCCCGGACTTCCACAGTTCGCCAGTCCATGCACCAGCTATACGTACGCAACCCCCTTCCCCCATGACGATAATGCCACCATCTCCCATCGCTCCCCTTGAACAATCCTTCGTGTTACACGCCCCTTTTGGTGACGACAGTTCCAGTTATGCCTCTGATAACAGTATGGGATCAAGCAATAAAAGTTCTAAGTCAAAACTGGGAAGGGCCAAACGAATCTTTCATAGAGGACTTAAAAGTGACGATTCGAGCGCCTTGAGCGATTCCGAATGCCAGCCGGCTAGTTGGAGGCACAATTACTATAAAGAGCATAACACTCATCAAAATACTGAGGTTTATGTTGAACCTCCATCACCAGTTGAACTGAAGCCAGATCCAGAAGTACTTCGAGAGGTCCAAAAATCACCAGCACATACAGCTCATAGACGCCAGGCAATACGGACGAGTTCATTGAACTTGGGAAAGGACACACCAAAATTTCGTGATAGAAGTCTCAAAAGGGAAAAATCAGCATCGAGAAGCTCCGAATTGGACAGTAGTCCGAACTCAGCGGGGTCAAGGTCGCATGATGGTGACCACGATGACGATGATGACAGCCTGGATATAAAGACAAAGAAAAGCAATATACAAAGATGGTACTCGTTCAGGACGAACGTACTTAACGGCGGCCCCAAAGCAAATAATACTCTGCAGCCTGCCCCCAACCAAAAGAATCCTGAAACTTACTTGTCACGTCCGATGTCGTCGCTGTCGTGTGGTCAGCTGCATATACTGAGGAAGCTAGCGTTACTGCGGCTGACAGCTTGTATGGAAAGATATTGTCCGTCGCATAAATCCGGTTGGAATTGGGAGCTACCTAAACTTATAAGAAAGATCAAGACGCCAGATTACAAAGATAAGACAGTTTTCGGCGTACCTCTGACAGTATCTCTACAGAGGACGGGTCACTCTTTGCCAAAACCGATTCAATGCGCTCTCAATTGGCTGAAAAACAATGCATTAGAACAGATGGGTATATTCAGAAAGGCGGGAGTAAAATCTCGAATAGCCAAACTGAGAGCCATGGTGGAGTCAGCGGGAGCGACTACAGCAACATTCGCCATCGAGAACATGAACACCATAGACCCAAACCAGTCTAGTTTGAACTTCGACGGCGCCCAGGCTCATGACGTAGCGGATATGGTGAAGCAGTACTTCAGAGAGCTCCCTGATGCTTTGCTGACTAACAAATTGAGTGAAACCTTCATCGCTATATTCCAACATGTTCCGGAGCCGCTCAGGCCAGATGCCGTTCAATGTGCATTACTGCTTCTACCCGAGGAACATCTGGAGGCCTTGCAGTCTCTACTCATTTTCTTAGCGGAGGTCGCGGAACACTCAACCACGAATCAAATGACGGCATCAAATTTGGCCGTTTGTTTTGCGCCGACGCTGTTAAGACTACACCACGCGCCTCCGCAGACTGGTAGCACCAAAGAAGGAAATTCAAATAGAACCCTGATAGAGAGCGCCCTTGACCAGCGCCAGATCAGCGAGTCCCGAGCAGCACACGCCTGCCTGCTGCTGCTGGTGACGCAGCACAAACAGCTGTTCCTCGCACCAGCTGATATGCTGGCCAGGTGCAAGTTCAATTACTTCGAGGAGAGCGTACCGGTATCGCTTGAAGAGTTGGGAGCTGATTTCAATCAAGACTGGAAAGGTTTTCAGCAAGCCTGCATCAAAGCTTTACTCAAAGAAGCCAAAGAGAAAACGCGCGGCTGGATGTCAGTGTCGGGAGCGGCGCCTCACGTTGAGCTGTGCTGCAAGAAAGTGGGAGACGGACATCCGTTGAGGTTGTGGAGGGCGACCACGGACGTGGAAGCTCCGCCGCAGGAGGTGATGCAAAGGATACTCCGCGAGAGGCATATTTGGGACGATTCCCTTGTAAAGTGGCGCGTGGCTGAAAAGTTGGGACCTAACGCAGAAGTCTTTCAATACATCACAGCATCCAGCTTCAACTTGCCTCATAGGGACTATTGTGTGCTACGATCTTGGCAGCAGGGTGGCAGCGGCGGCGGCGGTTGGTGCGCGGTGGCGGAGACGTCGGTAGCTCACGGGGCGGCGTCCGATGGAGCCCGGGGCGTCGTCCTAGCTTCGAGATACCTCGCGCAGCCTGCGGGTAAAGGGCGCAGTAGACTTGTACATCTCGCCAGGGTTGATACTATGGGACGTACTCCGGAATGGTACAACAAATGCTATGGTCATATCTGTGCATTGTACCTGGCCAGGATTCGAGCCTCCTTCAAGCATAACACAGAAGGACCCGAATCGAATGTATGA

Protein sequence:

>DPOGS212033-PA
MQNGTVFEEDLQFPIDVSGVQNDHPFLDADSLQSLFRRLTALNRCATMKLEHHHHQKRSNDSDDEQCALSDKWQYERKSRRWSRVVEITPQAQRRLQVIAARALAEREAREARGEVEDDEDETLPTIRVGLVDADGHYTDVEPDLLTVPGHHMQLPSDKEDEEDTGTRFRRTGSERLRDGAKALLRRVESLKTRRRKRQNRTEVIVSSPHFLDVNQDKYADLNYIDMTPTTPTAFPFPDFHSSPVHAPAIRTQPPSPMTIMPPSPIAPLEQSFVLHAPFGDDSSSYASDNSMGSSNKSSKSKLGRAKRIFHRGLKSDDSSALSDSECQPASWRHNYYKEHNTHQNTEVYVEPPSPVELKPDPEVLREVQKSPAHTAHRRQAIRTSSLNLGKDTPKFRDRSLKREKSASRSSELDSSPNSAGSRSHDGDHDDDDDSLDIKTKKSNIQRWYSFRTNVLNGGPKANNTLQPAPNQKNPETYLSRPMSSLSCGQLHILRKLALLRLTACMERYCPSHKSGWNWELPKLIRKIKTPDYKDKTVFGVPLTVSLQRTGHSLPKPIQCALNWLKNNALEQMGIFRKAGVKSRIAKLRAMVESAGATTATFAIENMNTIDPNQSSLNFDGAQAHDVADMVKQYFRELPDALLTNKLSETFIAIFQHVPEPLRPDAVQCALLLLPEEHLEALQSLLIFLAEVAEHSTTNQMTASNLAVCFAPTLLRLHHAPPQTGSTKEGNSNRTLIESALDQRQISESRAAHACLLLLVTQHKQLFLAPADMLARCKFNYFEESVPVSLEELGADFNQDWKGFQQACIKALLKEAKEKTRGWMSVSGAAPHVELCCKKVGDGHPLRLWRATTDVEAPPQEVMQRILRERHIWDDSLVKWRVAEKLGPNAEVFQYITASSFNLPHRDYCVLRSWQQGGSGGGGWCAVAETSVAHGAASDGARGVVLASRYLAQPAGKGRSRLVHLARVDTMGRTPEWYNKCYGHICALYLARIRASFKHNTEGPESNV-