Monarch geneset OGS2.0

DPOGS208917
TranscriptDPOGS208917-TA2766 bp
ProteinDPOGS208917-PA921 aa
Genomic positionDPSCF300009 - 373428-393661
RNAseq coverage309x (Rank: top 37%)
Annotation
HeliconiusHMEL0115530.073.06% 
BombyxBGIBMGA002491-TA0.075.72% 
DrosophilaCG31158-PD0.058.65% 
EBI UniRef50UniRef50_D6X4C10.051.07%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X4C1_TRICA
NCBI RefSeqXP_969387.10.051.07%PREDICTED: similar to arf6 guanine nucleotide exchange factor [Tribolium castaneum]
NCBI nr blastpgi|3504053810.051.49%PREDICTED: hypothetical protein LOC100742088 [Bombus impatiens]
NCBI nr blastxgi|3504053810.052.29%PREDICTED: hypothetical protein LOC100742088 [Bombus impatiens]
Group
Gene OntologyGO:00320121.2e-46regulation of ARF protein signal transduction
GO:00056221.2e-46intracellular
GO:00050861.2e-46ARF guanyl-nucleotide exchange factor activity
GO:00055151.1e-22protein binding
KEGG pathwaytca:6578630.0 
 K12494 (PSD)maps-> Endocytosis
InterPro domain[494-678] IPR0009041.2e-46SEC7-like
[569-670] IPR0233942.1e-32SEC7-like, alpha orthogonal bundle
[700-815] IPR0119931.1e-22Pleckstrin homology-type
[5-91] IPR0014781.9e-18PDZ/DHR/GLGF
[700-814] IPR0018499.2e-13Pleckstrin homology domain
[702-721] IPR0016055.4e-09Spectrin/pleckstrin-like
Orthology groupMCL14755 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208917-TA
ATGGCTGACGAAAGGCTAGTGGTCCTGAATCGTTGTGACAACTTGGGTTTTGGATTTTCTTTACTCGGCGAAGCCGGTTTGCCTCATATTATTTACGAAATCGAAGAAAATTCTCCAGCTGCTAAGAGTGGTGAGGTTGAGGTTGGTGATGTATTGCTCAAGGTCAATGGAACTGATGTCAACAGGTTCAGCACACGCGAAGTCTTAAAATGCTTACGACTGTCGGCGGATCCTGTGACTCTTCGTCTCAAAAAAGATCCTCAGATCAAAGCGAACGTGCGACGCTATCTCTCATCCGGCGAGAGACGTTCTAGCGGGCCGCGTGTCAAACAAGATAAATGTGGATCGCCTCCGTCTAGTAATTCGAACAGCTCGTCTAGTTCGTCGAACGGTCTAGCTCGGTCTGGCGAGAGCTGCGAGGCGCTGATCGAGGACCGGTCTGACAGACCGAGACTCACCCAGCCGAAGTTCGAGGCCTACATGATGACCGGTGACCTGATGCTGAACCTCTCCAGGGTTGAACACCCACATCATAATCACCACGCGCCCACACACCGCACACACTATCATAGATATAATTCAACCCCAGCCTCCCCTAGTGAAAACCGTCTAGCTGCTCGTGTGGAACTAACACAGAGGCACAACTCATCACCTAACACCGGCCTATCGGATCATGCGAGTAAAATGTTCAACTCCCAGCCGGCATCTCCAGCTGGTGGTAATACCTCATCAGCTGAGTCCGCTACAAGGACCCAGCACATCGTCAGAACATCCAGATCCGAAGATCACTTACAGAAGGAATCGTCTTTGAGCGCGGTAGCTGTTGATATGGAAGAGGATGTGACGTCATCGCTCAATACACTATTGGACGCTCGGCCGGACTCCGCCACGCCAGGACCTCGCTCGGATTCCGACGAAAGAGACAGGATTGTATGGACGTACAATGCCCCGGTGTCGCAATGTAACGGGTCGGCCGCGACATCAAATTCTACCTCCATATCAGATGGGATGTCACAACGGTCTTCATCCCCGTTGTCGCCAACATCAGCGTCGTGGTCGGCGCTGTCACCACCTCACCGCGCACCCCCCCTACCACGAGCACCACTCAATGGCGACATGAGTTTATCGGAGGCTGTCTCAAACATATCCAGCCCCGATTTCCAAGACCAGGACGACATGTTCGAAACGGGTCGGGAGTGTCCGAGAATGGAACTGTCCGACCCGTCGGACTCGGACTCCACGATACTAGTGTCTGAGCCGTGTCACAAGAGAGCCAAGTCGAACTCCACGTACTCCACGGAACACGGGAGTGACGTCACCCTCAACGGCGACCACAGCAAGGAGTACAGAATAGTCATACAGGTCAAAGGTCCGGACAAACAGAACAACTCCAACGACAACGTCAACAATAATAATAACACGAATTACGCACAAAATGGCAAAGAGAACGGGCACAGCTCGCCCGAAAATCAAGGTTATCAGGAGCTGTGCAGTGGTTCGGACGCTTGTTCTGATGACGGGTCAGACGGTGATTCGCTTCACTCATTCCACTACAGTCCGAAGGCAGTGGACATACCTTCAGCTGAGAGGCTTGCGAAACGACTATACAATCTAGACGGTTTCAAGAAATCCGATGTTTCTAGACATTTAAGTAAAAACAATGATTTCTCCCGCGCCGTGGCGGAGGAATACGTGAAACATTTCGAGTTCGCCAACACTACTTTAGACGAAGCGCTACGAGCGTTCCTCGCGCGGTTCGCTCTCAGTGGAGAAACTCAAGAAAGAGAACGAGTCTTAGTTCATTTCTCACGACGGTATTTAGAGTGTAACCCGGGAGCGTTCAATTCACAAGATGCCGTTCACACGCTCACCTGCGCGATAATGTTACTTAACACAGATCTCCACGGCTGCGGAGGGACGTTCAGGCGCATGTCGTGCGCCGAGTTCATTGATAACCTGGCTGATCTTAACGACGGCGAAAACTTCCCTAGAGAAACATTAAAACACTTGGACTCTGAAGCAACAAACGAGATTCGGACCGCCCCCGCCGTTGGCAACAATCCGTTCCTCGACTTGCCGGACCAGAGCCGCGCGGTCGAGTACAAGAAGGGTTATGTCATGAGGAAATGCTGTTACGACGCTAACGGAAAGAAAACTCCATTCGGCAGACGTGGTTGGAAGATGTTCTACTGTACGCTGCGTGATCTAGTCCTTTATCTGCACAAAGACGAACACGGCTTCCGACGGAGTCAGATGTCAGATAACCTGCACAACGCTATAAGAATACATCACGCCCTGGCGACTAAAGCCACAGATTATACAAAAAAACAACACGTGTTTAGACTGCAAACTGCTGACCAGGCCGAATACTTGTTCCAGACGAGTGATTCAAAGGAGTTGTGCTCATGGGTGGAGACGATCAACTTTGTATGCGCGTCGTACTCAGCCGCGCCTCTGGCTGGCGCTGTCGGCTCGCAAAGGAAATTCCAAAGACCACTGCTGCCTTGTACTCACACCAAACTTTCCATGCGAGAACAGCTAGCGGAGCATGAGGAGCGCGCTGCCCGTTTAGAGGAGGAATTGGCAGCGTTGAGACTAGCCAGAGATCCACACAGCAGGGACAAAGATCATTACCTCGTGCACGAGATAAAGAGGTATCGAACATACGCGTATGTGATGCGTACTCGTGGTGGCGGCATCGGCGCCGAGGAGAACGCGCCCGCGCTGCCCGAGCGTCCTCACAACCCTCACCACGCGCCGCCCTGA

Protein sequence:

>DPOGS208917-PA
MADERLVVLNRCDNLGFGFSLLGEAGLPHIIYEIEENSPAAKSGEVEVGDVLLKVNGTDVNRFSTREVLKCLRLSADPVTLRLKKDPQIKANVRRYLSSGERRSSGPRVKQDKCGSPPSSNSNSSSSSSNGLARSGESCEALIEDRSDRPRLTQPKFEAYMMTGDLMLNLSRVEHPHHNHHAPTHRTHYHRYNSTPASPSENRLAARVELTQRHNSSPNTGLSDHASKMFNSQPASPAGGNTSSAESATRTQHIVRTSRSEDHLQKESSLSAVAVDMEEDVTSSLNTLLDARPDSATPGPRSDSDERDRIVWTYNAPVSQCNGSAATSNSTSISDGMSQRSSSPLSPTSASWSALSPPHRAPPLPRAPLNGDMSLSEAVSNISSPDFQDQDDMFETGRECPRMELSDPSDSDSTILVSEPCHKRAKSNSTYSTEHGSDVTLNGDHSKEYRIVIQVKGPDKQNNSNDNVNNNNNTNYAQNGKENGHSSPENQGYQELCSGSDACSDDGSDGDSLHSFHYSPKAVDIPSAERLAKRLYNLDGFKKSDVSRHLSKNNDFSRAVAEEYVKHFEFANTTLDEALRAFLARFALSGETQERERVLVHFSRRYLECNPGAFNSQDAVHTLTCAIMLLNTDLHGCGGTFRRMSCAEFIDNLADLNDGENFPRETLKHLDSEATNEIRTAPAVGNNPFLDLPDQSRAVEYKKGYVMRKCCYDANGKKTPFGRRGWKMFYCTLRDLVLYLHKDEHGFRRSQMSDNLHNAIRIHHALATKATDYTKKQHVFRLQTADQAEYLFQTSDSKELCSWVETINFVCASYSAAPLAGAVGSQRKFQRPLLPCTHTKLSMREQLAEHEERAARLEEELAALRLARDPHSRDKDHYLVHEIKRYRTYAYVMRTRGGGIGAEENAPALPERPHNPHHAPP-