Monarch geneset OGS2.0

DPOGS203926
TranscriptDPOGS203926-TA1044 bp
ProteinDPOGS203926-PA347 aa
Genomic positionDPSCF300005 - 517655-523578
RNAseq coverage421x (Rank: top 29%)
Annotation
HeliconiusHMEL0135057e-15376.15% 
BombyxBGIBMGA002030-TA2e-5781.67% 
DrosophilaGap69C-PA6e-6345.25% 
EBI UniRef50UniRef50_Q8N6T31e-6853.44%ADP-ribosylation factor GTPase-activating protein 1 n=108 Tax=Euteleostomi RepID=ARFG1_HUMAN
NCBI RefSeqXP_394952.34e-6649.33%PREDICTED: similar to GTPase-activating protein 69C CG4237-PA [Apis mellifera]
NCBI nr blastpgi|2977075532e-6853.82%PREDICTED: ADP-ribosylation factor GTPase-activating protein 1-like, partial [Pongo abelii]
NCBI nr blastxgi|3227976006e-7243.31%hypothetical protein SINV_14456 [Solenopsis invicta]
Group
Gene OntologyGO:00323126.6e-52regulation of ARF GTPase activity
GO:00080606.6e-52ARF GTPase activator activity
GO:00082706.6e-52zinc ion binding
KEGG pathwayhsa:557385e-69 
 K12492 (ARFGAP1)maps-> Endocytosis
InterPro domain[7-124] IPR0011646.6e-52Arf GTPase activating protein
Orthology groupMCL12582 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203926-TA
ATGGCATCACCGCGTACCCGACGTAAATTAAACTTTGTCCGCACTCAAGAAGAGAACCACAAATGTTTTGAATGCGGTACTCTTAATCCGCAATGGGTCTCAGTCACATATGGAATTTGGATTTGTCTGGAGTGTTCAGGTGTTCATAGAAGCCTTGGGGTTCATCTTTCTTTTGTTAGATCAGTTACAATGGACAAATGGAAGGATATTGAGCTGGAGAAAATGATGGTGGGGGGAAACCTTAAAGCTCGTACATTTTTTGAAAGTCAACCTGATTATAAACCAGATATGAAAATACAGCAAAAATATAACACTAAGGCTGCAGCCATGTACAGGCAAAAGATAGCAGCTCTCGCTGAGGGTCGTGATTGGAGTCCATCAGATTACAAACCGGATGTAGTCGAGAGACCGTCGGATTGGAGCCAATCACAGTCGTTCTACTCCTCTGGTGATAACACATTCCACACCAGCGGTTCAGATAACAACATCAGTTACCATAGCGAGTACGGTAGCGGACGCTATACAGGATTCGGGAATACTCCGAAGCAGTCCCAGTCAACTCCTATGTCTCCCATCCACAGCGGGCATGAGATCGTTGACAATACACTATCTTCATTAGCCTCTGGCTGGTCAATATTCGCGTCATCCCTTTCTAAGGCTGCTCGTACGGCCACCGAGAGTGCGGTCAAGTATGGAGGGATCGCCTCGCAGAAGGTCTCCGAAATGGCTTCCACCGTCACTGAGAAGGTTAACAATCGCGGCGGCTGGAGTAGTCTCAGTGGTGCCAATGAGATGCGTCGTTCAAGCAACTCTAGTGCCCACTTCCAGTCACCTAGCTATGGAGCCATGAAGAACTCAACATCTGAGCCGTACTCTCAATGGAATGAGCAGGTCCAGCCAGGCATTCAGTCGGTGGCTGCTCGTAATAACTTGGACTCCCGAGATCTCTCCCAGGTCGGAGTATCAAGCCCTCCTCCAGACATAAAGAATATCAACATCAAGAAACGGTCAAGCGACGACTCCTGGGACTGGCTGAACAACTAG

Protein sequence:

>DPOGS203926-PA
MASPRTRRKLNFVRTQEENHKCFECGTLNPQWVSVTYGIWICLECSGVHRSLGVHLSFVRSVTMDKWKDIELEKMMVGGNLKARTFFESQPDYKPDMKIQQKYNTKAAAMYRQKIAALAEGRDWSPSDYKPDVVERPSDWSQSQSFYSSGDNTFHTSGSDNNISYHSEYGSGRYTGFGNTPKQSQSTPMSPIHSGHEIVDNTLSSLASGWSIFASSLSKAARTATESAVKYGGIASQKVSEMASTVTEKVNNRGGWSSLSGANEMRRSSNSSAHFQSPSYGAMKNSTSEPYSQWNEQVQPGIQSVAARNNLDSRDLSQVGVSSPPPDIKNINIKKRSSDDSWDWLNN-