Monarch geneset OGS2.0

DPOGS201484
TranscriptDPOGS201484-TA3120 bp
ProteinDPOGS201484-PA1039 aa
Genomic positionDPSCF300006 + 112782-143650
RNAseq coverage486x (Rank: top 26%)
Annotation
HeliconiusHMEL0159400.078.46% 
BombyxBGIBMGA002673-TA3e-15076.52% 
DrosophilaCG30372-PB0.060.02% 
EBI UniRef50UniRef50_E0VXN30.062.18%130 kDa phosphatidylinositol 4,5-biphosphate-dependent ARF1 GTPase-activating protein, putative n=2 Tax=Neoptera RepID=E0VXN3_PEDHC
NCBI RefSeqXP_397124.30.065.13%PREDICTED: similar to CG30372-PB [Apis mellifera]
NCBI nr blastpgi|3504081480.060.48%PREDICTED: arf-GAP with SH3 domain, ANK repeat and PH domain-containing protein 1-like isoform 1 [Bombus impatiens]
NCBI nr blastxgi|3504081480.060.86%PREDICTED: arf-GAP with SH3 domain, ANK repeat and PH domain-containing protein 1-like isoform 1 [Bombus impatiens]
Group
Gene OntologyGO:00323123.1e-38regulation of ARF GTPase activity
GO:00080603.1e-38ARF GTPase activator activity
GO:00082703.1e-38zinc ion binding
GO:00055151.2e-20protein binding
KEGG pathwayame:4136830.0 
 K12488 (ASAP)maps-> Endocytosis
    Fc gamma R-mediated phagocytosis
InterPro domain[581-734] IPR0206832.2e-48Ankyrin repeat-containing domain
[460-580] IPR0011643.1e-38Arf GTPase activating protein
[979-1038] IPR0014521.2e-20Src homology-3 domain
[343-438] IPR0018494e-14Pleckstrin homology domain
[338-437] IPR0119931.5e-13Pleckstrin homology-type
Orthology groupMCL10457 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201484-TA
ATGCCGGGTCTTATTGGTATTGGTGAATTTATTGACGAAACTCGAGAGGATTACAGTTCGCCAACTACTTCGACATTCGTATCGCGTATGCCACAATGCAAACAGACCATTAACGCATTGGAGGAGTGGCTCAAACACGGGGGGAATAGATTAATGGCAAGGGAGACAGATGTTCGTACCTCCTTTGAATGGCTCGTGGAAGCCATTGTGGACTGGTCGCAGTTCAAAAAGAATGGTCTTGTTACCCTGGATTTTGATCGCGATGGACTAACAAAGCTCAAAAAAGCAATAAAAGCCATACACAACTCAGGGAATGCCCACGTGGATAATGAGATGTACCTGTCCCGGGCGCTGGAGCGTCTCGGAGGAAACGCGCTCAGCAAAGATTCGGAACCCGACATAGGAGCCGCTTTTTTCAAATTTGCGGTCGTCACCAAAGAACTGTCAGCACTCATGAAGACACTGATGCAAAATATAAACAATATAGTGATGTTCCCTGTGGATAGTCTCTTAAAGGGTGATCTCAGAGGTGTGAAGGGGGACCTGAAGAGACCGTTCGATAAGGCATCCAAAGACTACGAGTCAAAGTACCTGAAGATTGAGAAGGAAAAGAAGTCACAAGCGAAGGAAGCCGGCCTCATAAGGAGTGAAGTCACATCGGCTGAGATAGCTGATGAGATGGAGAAAGAGAGAAGATTGTTTCAGCTGCAAATGTGTGAGTACCTGATAAAGTTTAATGAAATAAAAACCAAGAAAGGCATAGAACTGCTTCAGCACCTGGTGGAGTACTATCACGCACAAACTAATTATTTCCAAGATGGTTTAAAGACTATCGAACACTTCGGTGTGTACGTAGCGGATCTTAGTGTTCAATTGCAGAAGATAAGACAGCAGCAAGACGACGAGCGAAGAAGATTATTGGAGTTGAGGAACATGTTGAGAGCAGCTACACCGCAAGATAGAGAAGCGGCGGCTGCAGTGGGCGGATATTCACTTCACCAATTACAAGGTGACAAACAGCATGGAGTCACACGGAGCGGTTATCTTCTTAAAAAGTCGGAAGGTAAAGTTCGAAGGGTCTGGCAAAAGAGAAGATGCAGAGTCACGGCTGAGGGTTTCTTGGACATATTCCACGCGGACGAGAATAAAACTCCGGCCAGGGTCAATCTGCTTACGTGCCAGATAAAAGTGGCCTCGGAGGATAAAAGGGCGTTTGATCTAGTCTCATATAATCGTACGTACCATTTCCAAGCTGAAGATGAGAACGAACAGCGTACTTGGACCTCGGTCCTCGTGAATTGTAAGGAGGGAGCTCTGATGAGGGCCTTCCACCAACAGGCTGGTGAGAGTAATGACAATGGACACTCACTACTCGATCTCCAGAGATCAATCATCAGAGCAGTGAGAGCTATGCCAGGGAATCAAGTCTGTGCTGACTGCGGTTCAACTAATGATCCCACGTGGTTGTCAACGAACTTCGGCGTGATAGTGTGTATAGAATGTTCTGGAAGCCATCGCGAACTCGGAGTACATATATCAAGGATACAATCTCTGACACTGGACAGACTGAGCACCTCACAGTTGCTGATAGCAAGGAACATGGGCAACCAGACGTTCAATGAAGTCATGGAAAACACATTGGATGAAAGAGATAAACTCACACCAGAGAGCACCATGGAGGAGAGACTGAGGTTTATACGGGAAAAGTACGTGTATAGAGCGTGGGCGGGCAGGACTTGTAGAGATGAAACAGAGAGGCTGTCGGAAGTGGAACACGCCGTTAACAACGGACACTTACAGAACTTACTCCAGGCTTATGCTGAAGGGGCTGACTTGAGTTCAGCTCTGCCCGGTTCTGATTGCGGTGAAACAGCGCTCCATCTGGCGATATCTAGAGAAATCGGCGACGGGTCCGGTCTGCACATAGTGGATTTTCTGATCCAGAACGGCGGTTCACTGCTGGATAAGACCACCTTATCTGGGATGACTGCCCTACACCTGTGTGCAGCAACGGACAGGGCGGAGGCCATGAAATTGCTGCTGAAAGCCGGTGCTGATACCACGCCGAAAGATATCAATGGAAGAACAGCTCTGTACATCGCGAGACAACTTGGACACCACGCTTGTGTGGAATTGTTGGAAAGCGTTGATAAAAGGGAAAAGAGTATATTCGAAAATATAAATATTGATTGGAATTTGTCACACGACGATGGCTCCACAGACTTCTCTGACGATGACACTGTCATCGATGAAAGGCAAAACGGCAGCGTTACTCCAGAGAAAAAGTGTCCGCGTTCCCGCCCGCCGTCATACGCTGGTACGTGTGGGGGCGACTCCCCTGTACTACGATCAAGATCCTCAACCTGCGACTCCCTCCACCACGCACCGCCAACCCCTACCACGCTACCGCGAAAACCCAACTATAATATAGGCAGTTTAAAAAAGCGTGTCGCCCCCCTTCCCCCCGCGGTGTCCGTGTCTCACCCCCCCTCGCTGGCTGCTCTGCCATCACACGCGAGATCCACACCATCGCCATCCGCTGACAACACCAGGTCCATACATGGGACCGGTGCAGCGCCGGCAGTGAAGCCGCGACCTCAACAGCCGCCGCCCACGCCGCCGCCCGTGTCACTGCACAACGGAGCCCATAGGGACGAACCGACACCTCCGCCCAGGAAGAAGAAAAATAGATTGCACCTGGAAAATGCTGTGAAAGGAAAACTTAAAAGGGAATGTTCTAGTCAAGATTCATCGTTGGAGTTATGTGATATATCTGATGGCTTGGACGACAGTAGGCTGCAGTCATGTCTGTCGGACAGTTCTAGATCTCGTGAGAGAACGAGACGCAGCGACAGGTCCCTAGATGTGTCAGACACTTCCAGTCTTCACTCGAGGTCACCTTCCACTTCTATAACTATGATGGGTGGAGGTATGAGGAGGTGCCGAGCGCTATACGACTGTTCTGCTGATAATGAGGACGAGCTGTCGTTCAGGGAGGGCGAGGTCATAGTCGTCATCAACGAACGAACCGAAGACGACAACTGGATGGAGGGACAGGTCGAGGGGACCAGCCGCCGCGGCATGTTCCCCGTCTCTTTCGTACACATGCTGCCTGATTAG

Protein sequence:

>DPOGS201484-PA
MPGLIGIGEFIDETREDYSSPTTSTFVSRMPQCKQTINALEEWLKHGGNRLMARETDVRTSFEWLVEAIVDWSQFKKNGLVTLDFDRDGLTKLKKAIKAIHNSGNAHVDNEMYLSRALERLGGNALSKDSEPDIGAAFFKFAVVTKELSALMKTLMQNINNIVMFPVDSLLKGDLRGVKGDLKRPFDKASKDYESKYLKIEKEKKSQAKEAGLIRSEVTSAEIADEMEKERRLFQLQMCEYLIKFNEIKTKKGIELLQHLVEYYHAQTNYFQDGLKTIEHFGVYVADLSVQLQKIRQQQDDERRRLLELRNMLRAATPQDREAAAAVGGYSLHQLQGDKQHGVTRSGYLLKKSEGKVRRVWQKRRCRVTAEGFLDIFHADENKTPARVNLLTCQIKVASEDKRAFDLVSYNRTYHFQAEDENEQRTWTSVLVNCKEGALMRAFHQQAGESNDNGHSLLDLQRSIIRAVRAMPGNQVCADCGSTNDPTWLSTNFGVIVCIECSGSHRELGVHISRIQSLTLDRLSTSQLLIARNMGNQTFNEVMENTLDERDKLTPESTMEERLRFIREKYVYRAWAGRTCRDETERLSEVEHAVNNGHLQNLLQAYAEGADLSSALPGSDCGETALHLAISREIGDGSGLHIVDFLIQNGGSLLDKTTLSGMTALHLCAATDRAEAMKLLLKAGADTTPKDINGRTALYIARQLGHHACVELLESVDKREKSIFENINIDWNLSHDDGSTDFSDDDTVIDERQNGSVTPEKKCPRSRPPSYAGTCGGDSPVLRSRSSTCDSLHHAPPTPTTLPRKPNYNIGSLKKRVAPLPPAVSVSHPPSLAALPSHARSTPSPSADNTRSIHGTGAAPAVKPRPQQPPPTPPPVSLHNGAHRDEPTPPPRKKKNRLHLENAVKGKLKRECSSQDSSLELCDISDGLDDSRLQSCLSDSSRSRERTRRSDRSLDVSDTSSLHSRSPSTSITMMGGGMRRCRALYDCSADNEDELSFREGEVIVVINERTEDDNWMEGQVEGTSRRGMFPVSFVHMLPD-