Monarch geneset OGS2.0

DPOGS209643
TranscriptDPOGS209643-TA1317 bp
ProteinDPOGS209643-PA438 aa
Genomic positionDPSCF300015 + 1118618-1122506
RNAseq coverage207x (Rank: top 46%)
Annotation
HeliconiusHMEL0170503e-15262.31% 
BombyxBGIBMGA006706-TA0.075.78% 
DrosophilaRbsn-5-PA1e-9540.78% 
EBI UniRef50UniRef50_Q29MW05e-9440.92%GA21126 n=4 Tax=Schizophora RepID=Q29MW0_DROPS
NCBI RefSeqXP_001989050.13e-9942.23%GH10254 [Drosophila grimshawi]
NCBI nr blastpgi|1950351536e-9842.23%GH10254 [Drosophila grimshawi]
NCBI nr blastxgi|1951143263e-9742.24%GI15463 [Drosophila mojavensis]
Group
Gene OntologyGO:00468723.9e-19metal ion binding
KEGG pathwaydgr:Dgri_GH102549e-99 
 K12481 (ZFYVE20)maps-> Endocytosis
InterPro domain[144-220] IPR0003063.9e-19Zinc finger, FYVE-type
[143-244] IPR0110115.2e-19Zinc finger, FYVE/PHD-type
[139-221] IPR0130831.1e-15Zinc finger, RING/FYVE/PHD-type
[393-432] IPR0215655.5e-13FYVE-finger-containing Rab5 effector protein rabenosyn-5
Orthology groupMCL13663 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209643-TA
ATGGCAGCATCAAATGAGGAAGAAATTTTGGAGGGTTTTCTGTGTCCAATTTGCAAAGCAGATTTAAAATCAGCTAGCCAGCTTACCAATCATTTTGAAAGCTTGCACCAAGAAGAGCAGGACGTTCTCAAATCATTGAAGGAAATATTTGGTAAAGCAAAGAAGATAATTTTAAACAATGACGATACAGATTTGAAGGAAACCTTTGATCGCGCCTTGAAGTTCAGTTCCCAAGAACAATATTATGCTAGGGAGGAGCAGACAGTTGGAGTGTCGCGAAGTTCCACTGATTATTTCAAGGCTGTTCGTTCAGCTAGATTAGAGAGATATGCCACGGAAACTAATAAGTTGTTAATAAGATTAGATAAACTGGTATGTAATATGCCAAGTGATCCAAACCAAAGGAAACAGCATGAACAGGAAGTTGTTCCTTGGTTGGATGGGTCGTCTGTGAAGTTATGTCCAAACTGTGCTAAAGCTTTCAACCTGACGAGGCGGAAGCATCATTGTAGACTCTGTGGTTCAATTCTATGCCATGACTGTTCAGTGTTCTTAGATTTAAATGTTGCTAGTAAATATCCGTCTGCACCTCCTCAACCAGAAACATCAGCGGTCGAGAAGTTTGGTCTCCGTCTCTGTGAACACTGCTACAACCTGATTCAACTCCGACGGCAGATTCAAGAGAACAGGAATGTGAAGACTGTGTTAATGTCAGCCTACGAACAGATGAGGAGCTTGATGGAACAAGCTACGCCAGCTGTTGAAATGTATGAGAAGATGTGTCAGAGTTTGTTTGACGGGGAGACTATATACAGTCTGTCCGATGTGAACGCTATGCGCGGTCGTATAGGGAAGCTGGCTGAGGGTATCGATCTGTTGAGCAAGCACATAGCGAGCCTGCCGGTACAGCCGGGGACGAGACAAGCCAAGCTCCAGAACTCTATCAGACAGGCCTCCGCACACTATATCAAGGAGGAGTTGGTCTCGCTACGGAAATTGCCAACAGAGGCTCAAATAGAAGAAGTTAGGAGACACCGGTACGAGCGGGCCCAGAAGCAGATAGAGTTAGAAAGAGAACGGATAGAGAGGGAAAGAGAAAGGAGAGAGAGGGAGTGGGGGGAGGAGGGGGAGACGAGCGGGGGCAGGGTCCAGCAGCACGACGATGATAACCCGATCCTAGAGCAGATGAACATTATAAGAGATTACATCAAAGAGGCCAGGAGAGAACTGAGGTTTGAAGAAGTGGCAATACTAGAGCAGAACCTCAAGGATCTGAAGAAGGAATATCAACTTCAGATGCTCTCAAACAAGTCCTAG

Protein sequence:

>DPOGS209643-PA
MAASNEEEILEGFLCPICKADLKSASQLTNHFESLHQEEQDVLKSLKEIFGKAKKIILNNDDTDLKETFDRALKFSSQEQYYAREEQTVGVSRSSTDYFKAVRSARLERYATETNKLLIRLDKLVCNMPSDPNQRKQHEQEVVPWLDGSSVKLCPNCAKAFNLTRRKHHCRLCGSILCHDCSVFLDLNVASKYPSAPPQPETSAVEKFGLRLCEHCYNLIQLRRQIQENRNVKTVLMSAYEQMRSLMEQATPAVEMYEKMCQSLFDGETIYSLSDVNAMRGRIGKLAEGIDLLSKHIASLPVQPGTRQAKLQNSIRQASAHYIKEELVSLRKLPTEAQIEEVRRHRYERAQKQIELERERIERERERREREWGEEGETSGGRVQQHDDDNPILEQMNIIRDYIKEARRELRFEEVAILEQNLKDLKKEYQLQMLSNKS-