Monarch geneset OGS2.0

DPOGS216191
TranscriptDPOGS216191-TA2835 bp
ProteinDPOGS216191-PA944 aa
Genomic positionDPSCF300080 - 67175-79988
RNAseq coverage755x (Rank: top 17%)
Annotation
HeliconiusHMEL0163250.091.50% 
BombyxBGIBMGA004600-TA0.073.56% 
Drosophilastep-PD6e-4041.00% 
EBI UniRef50UniRef50_E2BY840.058.84%IQ motif and SEC7 domain-containing protein 1 n=9 Tax=Formicidae RepID=E2BY84_HARSA
NCBI RefSeqXP_396854.30.050.88%PREDICTED: similar to schizo CG32434-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3407175420.051.05%PREDICTED: hypothetical protein LOC100647622, partial [Bombus terrestris]
NCBI nr blastxgi|3800306980.051.76%PREDICTED: uncharacterized protein LOC100863984 isoform 2 [Apis florea]
Group
Gene OntologyGO:00320123.4e-79regulation of ARF protein signal transduction
GO:00056223.4e-79intracellular
GO:00050863.4e-79ARF guanyl-nucleotide exchange factor activity
GO:00055152e-09protein binding
KEGG pathwayame:4134100.0 
 K12495 (IQSEC)maps-> Endocytosis
InterPro domain[444-635] IPR0009043.4e-79SEC7-like
[521-641] IPR0233943.1e-41SEC7-like, alpha orthogonal bundle
[642-776] IPR0119932e-09Pleckstrin homology-type
Orthology groupMCL11350 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216191-TA
ATGGAGGGGAACAGATATGTTACAGCTGTTCACCGACCGCTGCCAGCTCTAAACCACTCGAACTCGTCATGCTCTTCAGCATCGAACGCATCAGGCGCCGGGCTGTGCGCGGGAACCGAACCCACACCGCCGCTGTACATCGCCGCCCCGCCAGAGGGATTCGGGCATGCTGTGCATACAAACAGGCGAGGCACAGACGTGGGAATCATTGGCACAACTGGAATAACACAAAAAGATCCAACATTACCCCAAAAAAGGGGTAAACTGACACGAGGTCGATATGAAATGTCCCAAGATCTACTGGACAAGCAAGTAGAGATATTAGAGCGACGCTACGGTGGACTCAGAGCTAGAAGAGCAGCCGTCACTATTCAAAGAGCGTTTAGAAGAAGACAGTTGTTGAAAAAGTTCAGCGCTATCACCGCTATGGCTAAAGCAGCCGACTCTAACGCCAGGAGAATGCAAGATGGAGAACATTTGCTGGTCAACGGTAACGATATCTACGCGAACGCTCACACACAGAAAAGTCCAGCCTCCGATAAGGTTTTGGCCTTAACGAGACCCGTCAGATCAATGTCGCTGAGAGAGAGACACGACATCGACCTAAACAACCACATGCCGTGTGATCCTCAGAACGAGGTGGAGATGATGATGGCGAGGGTGCAGCAATCAGCTCATCAGATACACATGATAGCATCTGAGATACCAACCATGCATAGAGACGCCGACAGTGGGGTTTGCAGTTGTTCTGTGACCGGTTCCGTCACCAATATATCCACTTCATCATCACACAATGATTCTCTTCATAATCACCAGCTGCTAAATAACACAGGGCCAGTTTACGGTAACTTGTTATCGTCCGGCGGCAAACCACTGTCGTCCCCGACCGCGCGGCAACAGCAACAACTAGCAGCAATACAAGCGGCTAACGCCAGGCGACACCCGCCAGAAGTGCCGAAACGTACCAGCAGCATTACACTCAATGCTGAGAGTCCGGTATCTCTTCGCCGTGCTGAGTGCGGGGGCGCGGTGGTACGCGGCGGGTCGATGTCTTCCGTACAATCGTCGTCGTCGTCTTCGTCGCAGGAACATCGTCACCACTACCACCAGCCTTACCAGCCGAGAGCACCGGAGCCGCACATGCATATTCAGGAGCAGCAGTTTCAGCACGCTCGTACGGCATCCGGGACGCTGTATGAAGGGGAGAGATATGCTGTTTGGAAGAGGCAGGAAGCCCCGCCGTACTATGAGGCGCCCCCACCAGCTACATGCTGCAGGCCACAGCACGACCATCACCAGCACACACACCAACAACCGTTGAAGGTGTCGGAGACAGTACGTAAGCGACAGTACCGAGTGGGTTTGAACTTGTTTAACAAGAAGCCAGAGCGCGGTATCGCCTACCTGATATCTCGCGGGTTTTTGGAGAACTCCCCACGAGGCGTCGCCAGATTTCTCATCACCAGGAAGGGGCTCAGTAAACAGATGATCGGAGAATACCTCGGAAACCTTCAGAGTCAATTTAATATGGCTGTGTTAGAGTGTTTTGTTACCGAATTGGATTTATCTGGTATGGCGGTGGACGTTGCTTTGCGAAGATACCAGGCTCATTTCCGTCTCCCCGGGGAGGCTCAGAAGATCGAACGACTGGTCGAAGCCTTCGCTAGAAGATACTGTGTATGCAACCCAGACTTTGTGCAGCGATTGAGAACTCAGGACACGATTTTCGTGCTAGCTTTCGCCATAATAATGTTGAACACGGATCTGCACACACCAAATCTAAAACCAGAGGCTCGTATGTCGCTGGACGACTTTGTAAGGAACCTCCGCGGCATAGACGACTGCGGAGACATAGACCGAGATATGCTGGCCGGGATATATGATCGAGTGAAGGCCAGCGAATTTAGACCCGGCAGTGATCACGTCACACAGGTAATGAAAGTACAAGCGACCATAGTCGGTAAAAAGCCCCACTTGGCTTTGCCTCACCGCCGTTTGGTCTGCTACTGCCGTCTGTATGAAATACCAGACATACACAAGAAGGAACGACCGGGGGTACATCAGAGAGAGGTGTTCCTGTTCAACGATCTGTTGGTGATAACTAAGATCTTCAGCAAGAAGAAGTCGTCCGTGACCTACACGTTCCGCCAGTCGTTCCCTCTTTGTGGCATGATCGTCAATCTGTTCTCTGTGCCGCACTATCCATTCGGTATCCGTCTATCCCGCCGTGTGGACGGCCGCCGTCTCGCCACCTTCAACGCCAGGAACGAGCATGATAGATGCAAGTTCGCCGAAGATCTGAGGGAGAGTGTGGCTGAGATGGACGAGATGGAGGCGATGAGGATCGAGGGGGAACTGAACAGGGGCAAGGGGAGGGGGACGGGGAACACCGGCAGGAATGTGATGGAGAATCGTGACAGCGGCGTGGCTGATGTGGAAATAGATCACCAACAGTGTATGGAGCACGTTCACGTCCACGGCGTGGTGGTACCCCCGCCCCCGTCGTGTCCAGCGCCCGCCGCCCCCTCCCGAGTTATACATCAATATGTTGATGTTGTAAATCTCCTTAACCACTCACGTGTGACCCCTTGTACAGTGGCGCTGGCGGCTGAGCGTCTTCAGCGTCGCGGGTCTGTGGGGTCTCTGGACAGTGGTATGTCGGTTTCTTTCCAACACGGGAGTAGAACAGCACTCGCCACATCCCCCCGACATCCGCCGGACCAACGATCGCCGGGTCGCCTCTTCGGCGGTATATTCCCCGGTCGTCAACGCAAACTGAGCGCTTCCGGTGTGCCAGACACGAAGGGCCAAATCGCCAAAAGCACTGAGGTGTAA

Protein sequence:

>DPOGS216191-PA
MEGNRYVTAVHRPLPALNHSNSSCSSASNASGAGLCAGTEPTPPLYIAAPPEGFGHAVHTNRRGTDVGIIGTTGITQKDPTLPQKRGKLTRGRYEMSQDLLDKQVEILERRYGGLRARRAAVTIQRAFRRRQLLKKFSAITAMAKAADSNARRMQDGEHLLVNGNDIYANAHTQKSPASDKVLALTRPVRSMSLRERHDIDLNNHMPCDPQNEVEMMMARVQQSAHQIHMIASEIPTMHRDADSGVCSCSVTGSVTNISTSSSHNDSLHNHQLLNNTGPVYGNLLSSGGKPLSSPTARQQQQLAAIQAANARRHPPEVPKRTSSITLNAESPVSLRRAECGGAVVRGGSMSSVQSSSSSSSQEHRHHYHQPYQPRAPEPHMHIQEQQFQHARTASGTLYEGERYAVWKRQEAPPYYEAPPPATCCRPQHDHHQHTHQQPLKVSETVRKRQYRVGLNLFNKKPERGIAYLISRGFLENSPRGVARFLITRKGLSKQMIGEYLGNLQSQFNMAVLECFVTELDLSGMAVDVALRRYQAHFRLPGEAQKIERLVEAFARRYCVCNPDFVQRLRTQDTIFVLAFAIIMLNTDLHTPNLKPEARMSLDDFVRNLRGIDDCGDIDRDMLAGIYDRVKASEFRPGSDHVTQVMKVQATIVGKKPHLALPHRRLVCYCRLYEIPDIHKKERPGVHQREVFLFNDLLVITKIFSKKKSSVTYTFRQSFPLCGMIVNLFSVPHYPFGIRLSRRVDGRRLATFNARNEHDRCKFAEDLRESVAEMDEMEAMRIEGELNRGKGRGTGNTGRNVMENRDSGVADVEIDHQQCMEHVHVHGVVVPPPPSCPAPAAPSRVIHQYVDVVNLLNHSRVTPCTVALAAERLQRRGSVGSLDSGMSVSFQHGSRTALATSPRHPPDQRSPGRLFGGIFPGRQRKLSASGVPDTKGQIAKSTEV-