Monarch geneset OGS2.0

DPOGS204158
TranscriptDPOGS204158-TA6831 bp
ProteinDPOGS204158-PA2276 aa
Genomic positionDPSCF300034 - 569383-583228
RNAseq coverage242x (Rank: top 43%)
Annotation
HeliconiusHMEL0047390.077.33% 
BombyxBGIBMGA005041-TA0.069.67% 
DrosophilaCG5270-PB2e-10526.13% 
EBI UniRef50UniRef50_D6WSS32e-13627.61%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WSS3_TRICA
NCBI RefSeqXP_973824.23e-13627.54%PREDICTED: similar to CG5270 CG5270-PA [Tribolium castaneum]
NCBI nr blastpgi|2700102056e-13627.61%hypothetical protein TcasGA2_TC009577 [Tribolium castaneum]
NCBI nr blastxgi|2700102057e-16027.25%hypothetical protein TcasGA2_TC009577 [Tribolium castaneum]
Group
Gene OntologyGO:00468724.9e-18metal ion binding
KEGG pathwaymdo:1000152211e-09 
 K12478 (EEA1)maps-> Endocytosis
    Phagosome
InterPro domain[1448-1515] IPR0110111.7e-18Zinc finger, FYVE/PHD-type
[1441-1511] IPR0003064.9e-18Zinc finger, FYVE-type
[1442-1511] IPR0130832.1e-16Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL12455 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204158-TA
ATGAAGAAAGAGGAGCTGCAGTACACAATTCGTCTATTAGATAATCTGACAGTGGAATTGTTTGAGGAAATCGTTGAAGCATTGTATAACACAGAGGATTCTAAACCAACCACCGAATTATCTGAAAGAGTTATTGAATTTTTTAGAAATAATTTAAAATCCAATCCAGTCACAACATCATTGATTCTAAATCACATTCAAAGCACCACAGACGATGACAAAAATGTTTGCGCAATTCAGAAACTCTACTTAGAAGCTTTGAATAATGAACTAGGAAATAACTGTCCAGATCGAGTTATAACAATATTGTCATTAGCAAAGTGGCAAGAAAATTTTTTACCTCAGTTGATAAATGAAGTTTTGAAACACTTTTCTAACGATGCATTCAGACAATGTAAGAAACAGGTGCTGATTAGTCTTGTCGCTCTTCGGAGACCAGAACTTGTCAATATTGTATCAGATTTGATGCATAGGAATGAGCCCCAAAAAAATTTTAAACTGACAGCTGAATATATGATCGAGTTGTGCTATCAAGAAAAAACACATTGGCTGTTAAAAGCTGCTCAAGTTTATAAGCAACAGCTGCATGATATGAAGAATGTGACCTTTGAAATAAATAATTTTACTAAGCAAATACTCATTATGGTGCTTATTGATCTCACGAATTGCCCCGAGACGTACGACCTGCCGTCATTGGTCAACCAGCTTACTAATATATTTTCTATACTGGATACATACACTACAACTGATCGTGAACTCATGTTTTATTTGACAGAGATTAAGAGAGAATTAACCATCATCAAATTCTGTCTTGAAAATAATTGTAAGATAATGAGCACTTCAATATTCAGTCAGGTTTTGGGGAAAAGTGCCCTAACTGTGATATCGCAAGTCTGCAGGCTGTCGAAGGTGGATAGATATAAAGTATCGAAGCTTTTAAATACAAACAATGAATTCATATCCGAACTAATATCATTGAATCACAAAGGCGGTTTTAATAAAAACATACAAATATCTAATTGGGATATTTCAACCTATAATAGTTACTGTGCTATAACGAAGGTAATAGAAACTATATTGCAGTCATCTGAACACGTGGACAATACGCAGGAGATGGGTACATCTCTGGACGACATCAAACGGCTCGTCTTGTCTATCCAGCCTATACAATTCTGCGTTGAAATTATAGAAACTATATTCGCTTGTCTGTTCCTGAGATACGAATATTTTTCATGCTATGAACACAATCAGGATTTCCCATGTGGGACTCATTCCGAATGTTCCTATTTTTATTCTAAAAAGCAAAACAAAAGTCCAAAGACCACAAACAGTTCCGACAGGGGATTCATTTGTAATGCGTTCACAACACAAATTATACTGAACACACTCAAACAGTGCTTCGAGAGTTTGGATGAGAGGTGGGATGTAAGAAAAATCGACAACGCCTCGTCTGTTTACTCTCGGTACAAACGACTGATAGATGTAGTTAACCATTCACTATGGAAGATGCAATTGGTGTCGACTCTCAGTCCTGACACCAGTCCACTTCAGCTGAAATTGTGCCTGGATTACGTAGAAGTTGACAACAGTAGTGAAGAAGACGACTTGCCGGTTGAACAGAAAATATCTAGGAGGAAGCCCAAAATGCGGAGAAGACATACAAACCCAAGATCTGAAAACGGTCAGATTATGCTGTCATCTACTATTTCGGCTTCTGAAGATGCTTACACATCGAAGAAGAACACGATAGACTTCATATCTATTATGCTCGCCAAGCCCAATTGTCTCGTCGCATTGGCCCTCTACCAAAACGATTTCTCCAAAGCGAACCAAATTTTGGAGATGTTTGATTTATCTGATTCGGAGATTGCTACGGAAGTTACGTTTACCGAGCAGCTGAGACATTTAATAGCTAAAATAAAAAATATTATATGCTATAGAGATTGCTCGCGATATCCAACAAAAGAATTGTCAGCTCTGTCAGTAGTATCCACAGAGGTTATGGGTTTGGTTGAAGGTTTCTTGGCTTCTAATAGACCTCCAAATTCTTTTGACATCATTGAACTCGGTGCTCGACATCCGCATTTGCAACTGTACAGCCACCAAAACGCTCCCTTTATTAACGCCGTCGATATTCTTTTGAGCAGTGGCTTAAATAGAGAAATTACCTATGGTCTCATCAATGTGGTAGAAAGAAAACTCGCGGAGGTCAAACGTAATACCGACGTCACAGCCGTTAAGAAAAGCAATTATATAAAATTTGTGGAGGACATAACAAGTGTAACATTTGAAATTTTCGGCAATACAGATAAATCTTTTGATTTCATAGACAATGTTTGCGAATTAATAACAGATTGCAGAGTACCTATTAAACACAAAGAGTTCTGTAAGCTGAATCAGTTATCCAAAGAATTGAAGCAGTGCTGTTTAGATTTGTCGGATATTGTTTGTCCAAAAGAATCCAAGAAATTAGATGAAAATTTTCTCCAAAAATATCATCAAATCTATATGAATGTGGTTGCGGCGTCTGATAAAGACTTGTATAATATTGGGGACTTATGTCAAAATGGAAACAAGAAAGCAAGAGTGCATTATTTACGAAAATGTTATATGTACACAAAATCCATTTTTCAACTTGAGCAAGATGACCGGGTTGGTGAAGCAGCTATAGATGCACCATATTTTACGATTTTAGATCGACAGTTACATGACATATTTGGAAGTATGATAAATAGTAAAGAAATACCGATAACGTTTTTGGAGCCGATATGTAAAAAATTAAATGTGAATTTAATTCCTAAACTCATTCTTAATTTTTGTCCTTCAATAACATTGGCCGGCTCCTTGAAGGAAGCTACAGAAGAAAACTTCAATAACCTCCTTCATGTAGTGTACGACTTCATGAATCCTGAAGAGAAGTTATTTGTTGAACCGACGGCTAATTTCGCTCCTCTACCAAAAACTCCTGATCCTCAATGCCTTACTTACGTCGTCTTACATAATTGGGTGCTGGCACATATCATAAGAAAAATACATTCTACGTCAAATGAAAGCACTGCACAGAATAGTATGGACGCTCGTACCAAACACCTCAATAAATACATGGATCTAGAGAGATTCGATTGCACGAAGGTTTTGTTTGGTAGAAACAAATGTCTTGCTTCCCTGCACAGTACCATTGATTTGGACAAGTTATTTTTATTCCTTCCAAAAATGTTAGAAGCCGGAAAAATATGGCAGTGTCTGAAAATTATTGACGCATTATCAGAAAGACAAGTAAAGGGCAGCGCGAATTTGTTAAACTTACGCGATTTGATGTTGTGTAAAATTTCCACAAATTCTAAAGTGCAGCAAAATTGGAAGTATTGTCAGTATCTAAAATGTCCGGAACTTAAATTAGATTTAATATTAAACAACTTGTCGTGTTGGAACGCTGAAGGAGCTATGGAAGTTATTGATTATCTACGATTTTTATTGGATCAGACAGCGATCGAAGAAGGTCTCTATGACAAATGCACCGATTGGCTGATAAAAATACCTCTTTATGAACAGATTGCATCCATTATGGGGTCTCACCATTGGTATACAATTTATGAGAAATCAGTGGAAAACCCCGAAGCTATTATCGAAGTCCTGATGGACAGCCAACAGTTCAAATTATGTTTGGAATGGGCAGATGTTCACGATGTTTCAGATTATATGAAGAATTTGATAGTCGTCCACCTTATTAAACAATTATTTGAATACACTAACCAAGCGACGCCATCTTATGTTAGAGAACTAATTGAAAGGCTACCAACAACACAAGCTATAGAGTTAGTGATAGAAGAAATAACTAAAATAAGAAACATTGAAATTCTAGAAGTTTGCTACGACTTTTTAAACGAACAAAAAACTACGTGGAGATGTTTCGAAAACATACGAGTCGGTTTTCAAGTGATTCGTGAAATAGAATCTAACACGCGACATCTGTTCTGGGATCTGTTAGAAAAACCTCTGCTAATGATTGAACAACTCCTGATGAACGCCAAATTGGAGTTACTGACTGTTATCATAAACAAAATATCGCCAACCCTGCGAAGGAACGATTCTACTGGCGATAATTTGTATTATAATATAAAATCAATAGAATCGGTCTTGATATCCCAGAACGCTGTCGATGCGCTGCTGCGTTATTACGCAGAAAAAGCATTAGACATGAGGAATCCTAGAAATAGATCTTCATCACCACCAAAACCTTGGGACGACTCGTTGTTGCAATCGATAGACTCGATTAATATCGAATCTGCATCGAAACCTTTCGTGATGCCCGAACACGTCCCTAACAAGAAACAATGGGTGGAAGATTATTCCACGGACCGCTGTATGATGTGCAAAATATCGATATTTTCAATGATCATACGTAGACATCACTGTCGTCGATGTGGCCGGCTGGTTTGTCACGGATGCTCCAGAAATAGAATGCAGGTGCCAACGTATCCTAGTGGCGTAAAGTTGCGTGTCTGCGACGATTGCTACACTCAGACAATGAACAAGAGGTCCGAGTCCAATCAGGACATGATGCTAAGTAGTAACTCTGATACCACCGGCAGTGGCACCACTTGCCTGGATTGGTGCCTGTCCGTCGACGCTGCCAAAAACGAGGCCGTCAGGGCCGAATTCAGTTACGAGTTTACACCGAACGTGACGCTTTGCTTATCAATTATGAAGATGCACACCATCAACTTGGACTATCCCAGGTTTTTACTAGATCGTAGTGACGAGGCCGCTCGTTCTCTATCCCGCGGTGATTCTCGACTGTTAGTTCGAGCAAGACGATCTTTATTATTGGCAGCTGCCGAGTTGTACTCCAGGACTACTGGGGACAGGAGTTCTAGCGGCGGTGTCTGCGAGGCGGGGGCTTGTCACGCAGCTCGTTGCCTGGCTCACGCGGACGCTATGGCGGCGCTAGTGTCGCAGCAGGCTCACCACCTCGTCACCAATAACGCGGCACATCCCAGTCAAATAGTCCGCTCGCTTCTCGAGTCAGAGAAATGGGAATTGGCGTTAGAAATAGCGACGAAATCCGGAATACCAAGGACAAGTGTGTTGGCATCCTGGGGAAAAGCGTGCTTGAAGGCAGGATGCTTTAAAGAGGCTCGCAGAAAATTCGCCCTCTGTTTTAAGAACGCTCCAAATGTTTTCGCTGACGTCACTGAGGAGTTAGAGTATGGGAAGGAGGCGTCGGAAAATCTTATTAACAGAAGGTTTCAAAGTTTGAGAATCTCTGAGAGGTCCAATTCAATGTCCAGTACTCAGTCTGAATACAGGTGCCGGAGTAACCCGTTGCTGAACGAAATAATATCTATGTTAGAGGATATGAACTACCCAGTCAACCAGCAGCTGTTGGATAAGGCGGAAAATATTAAAACGACCAACGAAAAGTTGTCCAACATGAACACTAGGAAGAAGAAGATACCACTCGCAGAACCGGCATTGAATATAATGCATACGCTAGCGAGTGTCAAGAAAATTAAACAAGGCGACTATAGCGACTTCCAGACAGCCACAGTCCAACCAAAAAAATCATTGGCTCAAGGACTTCTCCGCAGAAATAATAACCCCGAACCGAAAGTCGACTACCAAAACAAGAAACTTGATCCGTTCTTCTACAAGGAGTGCGTATACTACCTTAGCAGCTACGGCTCGTTTGTGGACAATATAATGTTTTATATGAAACATAGCAATATGAGCGAGGTTATCCGCTACTGCTACGACAACTCAGTTGACAAAGAAACCTTCACTGAGTCCGTGTATATGAACTGTTTGAAGAAGAACAAGGTCGATGAACTCGTTAAATCTATGAAGGAAATGGACAGCAGTTTCAATATGTGGTCGGAGTATATAATGCACATCTGCCGCACGCTGGAGATGTCGAAGCGTCTAGAAGCCCTGTACGCACTGCAGTGCGGTACCGGCCAGCACGCTCGAGCCTCCGCTTCGTGCGCGTTACTGTACTCCCGCCCGCTACCAGCTGGCCGAGACCCCTTCAGTGAACTGGTGACCCGCCAGCACCACCTCACGGCGGCCATGCATCATTTGAACCAGTGCATTCCTGTGAACAAGGTGAATGATCAGAAATCGATTCATTTCCACCTCGACAAAGCTACTATAGACAACTTAATGAGCACTATATCCCGGCAACTGGAAGTCGCTAAGTATCTAGCGGCCTGCGAGGCCAGCGGGAGTTTGAACAATAAAATTATAAACGCTGTGATACCACTCCAGAACCTACGAAGAAATGATTCCGATCTGAGACCCTTGACTCTATTCGGTTCGAATACAGATAAAATAAGGATTGTTGCTATTGTGCTCGTCTCGGGTCAGACTGTGGAGGTTGGATTCGATTTAGCTTTTAAAATAATCTGCGAACACAAATTAGACTCGATGAACATATACACGCACGTCGCCAAGTATTTAGTGAACGCGGACAGGTTCATGGAAGTGAAGAATTTGGCAAAATGTATACGGACCTCTAAGGAGACGGCTGCTAGCCTGATGAGTGATCAGGTGTTGGAGGCTGGTACGACTGCTGTCGTCGGCAGGTGTGAGGCCCGAGGACAGTTACATGACGAACAAGCCGAGTTGCTCATAGCGGATATCAACAGTGTCGCTATCAAGATTTCATGCTATTTGGTATGTCATAACGTCAGCAGTGCGTACATCTTAGCAGCTAGACACGATAGAATTAACGATTTGCGAAAAGTCTTGCAAGAAGCAGAAAGATTATGTAACGAACAAGTAAGGAATGCTTGTCTCAAACGACTCACATCTAAGAATATTCTCACTTAG

Protein sequence:

>DPOGS204158-PA
MKKEELQYTIRLLDNLTVELFEEIVEALYNTEDSKPTTELSERVIEFFRNNLKSNPVTTSLILNHIQSTTDDDKNVCAIQKLYLEALNNELGNNCPDRVITILSLAKWQENFLPQLINEVLKHFSNDAFRQCKKQVLISLVALRRPELVNIVSDLMHRNEPQKNFKLTAEYMIELCYQEKTHWLLKAAQVYKQQLHDMKNVTFEINNFTKQILIMVLIDLTNCPETYDLPSLVNQLTNIFSILDTYTTTDRELMFYLTEIKRELTIIKFCLENNCKIMSTSIFSQVLGKSALTVISQVCRLSKVDRYKVSKLLNTNNEFISELISLNHKGGFNKNIQISNWDISTYNSYCAITKVIETILQSSEHVDNTQEMGTSLDDIKRLVLSIQPIQFCVEIIETIFACLFLRYEYFSCYEHNQDFPCGTHSECSYFYSKKQNKSPKTTNSSDRGFICNAFTTQIILNTLKQCFESLDERWDVRKIDNASSVYSRYKRLIDVVNHSLWKMQLVSTLSPDTSPLQLKLCLDYVEVDNSSEEDDLPVEQKISRRKPKMRRRHTNPRSENGQIMLSSTISASEDAYTSKKNTIDFISIMLAKPNCLVALALYQNDFSKANQILEMFDLSDSEIATEVTFTEQLRHLIAKIKNIICYRDCSRYPTKELSALSVVSTEVMGLVEGFLASNRPPNSFDIIELGARHPHLQLYSHQNAPFINAVDILLSSGLNREITYGLINVVERKLAEVKRNTDVTAVKKSNYIKFVEDITSVTFEIFGNTDKSFDFIDNVCELITDCRVPIKHKEFCKLNQLSKELKQCCLDLSDIVCPKESKKLDENFLQKYHQIYMNVVAASDKDLYNIGDLCQNGNKKARVHYLRKCYMYTKSIFQLEQDDRVGEAAIDAPYFTILDRQLHDIFGSMINSKEIPITFLEPICKKLNVNLIPKLILNFCPSITLAGSLKEATEENFNNLLHVVYDFMNPEEKLFVEPTANFAPLPKTPDPQCLTYVVLHNWVLAHIIRKIHSTSNESTAQNSMDARTKHLNKYMDLERFDCTKVLFGRNKCLASLHSTIDLDKLFLFLPKMLEAGKIWQCLKIIDALSERQVKGSANLLNLRDLMLCKISTNSKVQQNWKYCQYLKCPELKLDLILNNLSCWNAEGAMEVIDYLRFLLDQTAIEEGLYDKCTDWLIKIPLYEQIASIMGSHHWYTIYEKSVENPEAIIEVLMDSQQFKLCLEWADVHDVSDYMKNLIVVHLIKQLFEYTNQATPSYVRELIERLPTTQAIELVIEEITKIRNIEILEVCYDFLNEQKTTWRCFENIRVGFQVIREIESNTRHLFWDLLEKPLLMIEQLLMNAKLELLTVIINKISPTLRRNDSTGDNLYYNIKSIESVLISQNAVDALLRYYAEKALDMRNPRNRSSSPPKPWDDSLLQSIDSINIESASKPFVMPEHVPNKKQWVEDYSTDRCMMCKISIFSMIIRRHHCRRCGRLVCHGCSRNRMQVPTYPSGVKLRVCDDCYTQTMNKRSESNQDMMLSSNSDTTGSGTTCLDWCLSVDAAKNEAVRAEFSYEFTPNVTLCLSIMKMHTINLDYPRFLLDRSDEAARSLSRGDSRLLVRARRSLLLAAAELYSRTTGDRSSSGGVCEAGACHAARCLAHADAMAALVSQQAHHLVTNNAAHPSQIVRSLLESEKWELALEIATKSGIPRTSVLASWGKACLKAGCFKEARRKFALCFKNAPNVFADVTEELEYGKEASENLINRRFQSLRISERSNSMSSTQSEYRCRSNPLLNEIISMLEDMNYPVNQQLLDKAENIKTTNEKLSNMNTRKKKIPLAEPALNIMHTLASVKKIKQGDYSDFQTATVQPKKSLAQGLLRRNNNPEPKVDYQNKKLDPFFYKECVYYLSSYGSFVDNIMFYMKHSNMSEVIRYCYDNSVDKETFTESVYMNCLKKNKVDELVKSMKEMDSSFNMWSEYIMHICRTLEMSKRLEALYALQCGTGQHARASASCALLYSRPLPAGRDPFSELVTRQHHLTAAMHHLNQCIPVNKVNDQKSIHFHLDKATIDNLMSTISRQLEVAKYLAACEASGSLNNKIINAVIPLQNLRRNDSDLRPLTLFGSNTDKIRIVAIVLVSGQTVEVGFDLAFKIICEHKLDSMNIYTHVAKYLVNADRFMEVKNLAKCIRTSKETAASLMSDQVLEAGTTAVVGRCEARGQLHDEQAELLIADINSVAIKISCYLVCHNVSSAYILAARHDRINDLRKVLQEAERLCNEQVRNACLKRLTSKNILT-