Monarch geneset OGS2.0

DPOGS203250
TranscriptDPOGS203250-TA1638 bp
ProteinDPOGS203250-PA545 aa
Genomic positionDPSCF300210 + 119329-146954
RNAseq coverage1346x (Rank: top 9%)
Annotation
HeliconiusHMEL0058301e-15499.60% 
BombyxBGIBMGA004647-TA5e-4843.50% 
Drosophilastep-PD0.083.66% 
EBI UniRef50UniRef50_Q17HL65e-18081.41%Cytohesin 1, 2, 3, 4 (Guanine nucleotide-exchange protein) n=5 Tax=Metazoa RepID=Q17HL6_AEDAE
NCBI RefSeqXP_002432546.10.085.11%Cytohesin-1, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3071672890.076.94%Cytohesin-1 [Camponotus floridanus]
NCBI nr blastxgi|3071672890.076.94%Cytohesin-1 [Camponotus floridanus]
Group
Gene OntologyGO:00320124.2e-97regulation of ARF protein signal transduction
GO:00056224.2e-97intracellular
GO:00050864.2e-97ARF guanyl-nucleotide exchange factor activity
GO:00055151.5e-40protein binding
KEGG pathwayath:AT3G433004e-43 
 K13462 (MIN7)maps-> Plant-pathogen interaction
InterPro domain[205-390] IPR0009044.2e-97SEC7-like
[281-397] IPR0233942.3e-50SEC7-like, alpha orthogonal bundle
[404-526] IPR0119931.5e-40Pleckstrin homology-type
[407-523] IPR0018491.5e-22Pleckstrin homology domain
Orthology groupMCL10570 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203250-TA
ATGATAAGTGAATTCGAAAATTTGTCTCTCGGAGGAAATAATGCTGCGAGTAATCATGGGGATCCTTTTACACATACGGAGCTGACACCGGAGCAGCAAAAAACCTTAATAGATATACGACGCCGAAAAACTGAACTACTGCTAGAAATACAGGTAGTTGTCACGGTGCCAAAGACGGTGGCGCACGCGGCTCTCTTCAGTCGAGTGCGTTCCTTAGCGGGAAGTGTATGGCGCGCGATGTGTTTACATTTTCGGCTGCTGTGCGGGGCGCTCTGGTGGCCGGTGCTGATGCGCTGCGCTCGCGCAGCACGCTCCTATGACGACGATTACGATCAGGCCGTGCCCGTGGCCGCCGAAGAGGAGCGCCGTGGTTCAGTGTCTAATTGGTTCTCGTCTTTGAGACGGGGCGGTCGCCGTAAACGTGAAGACAGCGCCGTGCCGACTGTTACTGTCGGTTACGGTTCATTAGGCAGGCGGAAGGATGGCGGCCAGGCACGCGGGAAGACGAGATCGGCATGGGATCTCACCACCGTCACCAGGCTGCAACTTAAAGATGAGCTCGGCGAGGTGGTGGCCGAACTGGAAGCCCTCGATGGACAGGAGGAGTGCAAACAGAACAGCAAAGCCAAACAGATGAGCATAGGAAGGAAGAAATTTAATATGGACCCTAAGAAAGGAATCGAATATCTGTACGAGAATGGTTTATTACAAAGGACAGCGGAGGACGTGGCACAGTTCCTTCACAAGGGCGAGGGTTTGAGTAAGACGGCTATAGGGGACTATCTGGGGGAGAGATCAGACTTCAACGAGGCTGTGCTCAGAGCTTTCGTGGAACTTCACGATTTCACGGACCTCATACTAGTTCAGGCTTTGAGACAATTCCTATGGTCTTTCCGTCTACCGGGCGAGGCTCAGAAGATAGACCGTATGATGGAGTCGTTCGCCCAGCGCTACTGTCAGCTCAACCCTGACATATTCACCAACGCCGACACGTGCTACGTGCTCAGCTTCGCCATTATAATGCTGAACACGTCGCTACACAACCCCAGCGTGAAGGATAAACCATCGCCCGAACAGTTCGTCGCCATGAACAGGGGCATCAATAACGGAGGGGATCTACCGCAGGAACTGCTCTTGTCTCTATACGAGTCTATAAAGACGGAGCCGTTCAAGATACCAGAGGACGACGGGAACGATCTGATGCATACCTTCTTCAACCCGGACAAGGAGGGGTGGCTTTGGAAACAGGGCGGAAGGTATAAATCATGGAAGAGGCGATGGTTCATATTGAACGACAACTGCTTGTACTACTTCGAGTACACCACTGACAAGGAGCCGCGGGGGATAATACCGCTGGAGAACATATCAGTCCGTGCAGCGAGCGACCGTCAGCGTCCTCACTGCCTGGAGCTGTACGCGAGTGGTGGCGCGGATCTCATCAAGGCTTGTAAGACTGACTCCGAAGGGAAAGTTGTGGAAGGGAAACATACAGTATACCGCATGTCAGCGGCCACGGCTGAGGAACGCGACGAATGGATAGAATGCCTCAGACGGTCCATCAGTCACAACCCGTTCTACGACATGCTGGCACAGAGGAAGAAAAAGGCACAACACAACCTTCACTCAGGATCACACTAG

Protein sequence:

>DPOGS203250-PA
MISEFENLSLGGNNAASNHGDPFTHTELTPEQQKTLIDIRRRKTELLLEIQVVVTVPKTVAHAALFSRVRSLAGSVWRAMCLHFRLLCGALWWPVLMRCARAARSYDDDYDQAVPVAAEEERRGSVSNWFSSLRRGGRRKREDSAVPTVTVGYGSLGRRKDGGQARGKTRSAWDLTTVTRLQLKDELGEVVAELEALDGQEECKQNSKAKQMSIGRKKFNMDPKKGIEYLYENGLLQRTAEDVAQFLHKGEGLSKTAIGDYLGERSDFNEAVLRAFVELHDFTDLILVQALRQFLWSFRLPGEAQKIDRMMESFAQRYCQLNPDIFTNADTCYVLSFAIIMLNTSLHNPSVKDKPSPEQFVAMNRGINNGGDLPQELLLSLYESIKTEPFKIPEDDGNDLMHTFFNPDKEGWLWKQGGRYKSWKRRWFILNDNCLYYFEYTTDKEPRGIIPLENISVRAASDRQRPHCLELYASGGADLIKACKTDSEGKVVEGKHTVYRMSAATAEERDEWIECLRRSISHNPFYDMLAQRKKKAQHNLHSGSH-