Monarch geneset OGS2.0

DPOGS215454
TranscriptDPOGS215454-TA3066 bp
ProteinDPOGS215454-PA1021 aa
Genomic positionDPSCF300098 - 650774-660263
RNAseq coverage573x (Rank: top 22%)
Annotation
HeliconiusHMEL0083615e-12958.31% 
BombyxBGIBMGA007486-TA6e-3659.71% 
DrosophilaCG32350-PA1e-3928.18% 
EBI UniRef50UniRef50_E1ZWD60.046.93%Vacuolar protein sorting-associated protein 11-like protein n=11 Tax=Coelomata RepID=E1ZWD6_CAMFO
NCBI RefSeqXP_393972.30.047.25%PREDICTED: similar to Vacuolar protein sorting 11 [Apis mellifera]
NCBI nr blastpgi|3071904900.046.93%Vacuolar protein sorting-associated protein 11-like protein [Camponotus floridanus]
NCBI nr blastxgi|3071904900.046.82%Vacuolar protein sorting-associated protein 11-like protein [Camponotus floridanus]
Group
Gene OntologyGO:00055152.4e-11protein binding
KEGG pathway 
InterPro domain[29-229] IPR0159432.4e-11WD40/YVTN repeat-like-containing domain
[29-190] IPR0110462.8e-10WD40 repeat-like-containing domain
Orthology groupMCL14900 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215454-TA
ATGGCGTTTTTAGAGTGGCGTCGTTTCACTTTTTTTGATGTTCACAACAACCTGGACAATGGAAAGATTGCTGAGTGCTTGCAGGATACCAATGTAACAGTGGCTACGAGTGGCCACAACCACGTAATACTCTGTGATGTGACAGGGTGGGCTCATCTGATATCCCGCTCCTGGGAGATAATGTCATTCAAAGCTTACGAGATGACCGTCCTATTAGCGCAACAGCTACCACACGATCCATTCTTAGTAACTATTGGAGAAGATGAGTCCGGTGTCACACCTTTGATTAAAGTGTGGGACTGGTCGAGGGTGGACCGTCATGGGAACCCTCAATGTGTTCGAACTGCCCGTGCTATGCCGTCTCATGGACACAATGTACAAACTACCGCTCTAGCTGTGCATGACAATAAGAATCTCCTAGCCGTTGGTTTCCAAGATGGTTCAGTGACCTTATATCGTGGAGAGATATCAAGACAGCGTGGAATCAAAATGAAAACATTACCAGACACCGGATCCAGTCCTATAACTGGACTGGCTTTCAAAGGTGCTGATAAGTTGTTCGTGGTGTCTCGTTCCTGTGTAATGGTGTGCTGGTTGACCAGCGACCGTAGTGTGGTTCTGGACGCCATGGGAGCAGCACCAGGGTGCTCAGTGCTAGCAAACTCACATAGACTGACGGTTGCCGCACCGGATGCTATTTACTGCTATACTACCGAGGGTCGTGGTCCATGCTATGCTCTGGAGGGGGAGAAAGTCAGGTTGAACTGGTTCCGCAGCTACCTGGTGATAGTCACCAACGCCACCGGTTCAGCAAACACACCGAAATCCCATCACATCACGATATTGGACATTCAGAACAAATTCATAGTATTCTCTAAGACGTTCGAAGAAATCGATGCCGTCCTGACAGAATGGGGATCCTTTTACATTCTCCAGAAGAATAAGGAGATGATATTTTTGGAGGAGAAGGATCTTCAATCGAAGTTACTGTTGCTCTTCAAGAAGAACCTGTACGATGTAGCCATTAGGATAGCGAGCAGCCAACACTACGACGTAGAGGGGTTGACTGAAATATACAAGAATTACGGAGACCATTTGTATAGTAAGGGTGACCTTAAAGGGGCGATAGATCAATATGTGAAGACGATAGGCTGGTTGGAAACGTCATACGTTATACGCAAATACCTCGAATCCCGCCACCTGGAACCCTTGGTGCTGTATTTGGAGGAACTGCATAAGAAGGGTTACGCCACCGAAGACCACACCACGTTGCTGCTGACGTGTTACGTGAAAATCGACCAACACGACCAACAGGGGAAATTGAAGGAATTCATCAACTCCAAGGATAAGGCCATCGACTTCGACGTAGATGTTGCTATCAAGGTCGTCCGTCAAGTGAGTGCCACAGACGCGTTGTCACTAGCTTACAACTACAAGCGTCACGACTGGTACCTGAAAATAGTGACAGAGGATAAGAAAGATTACAAACAGGCTCTGGACTATATATCGGAACTAGAGTTTGAAGACGCCGAGATGTACATGAAGAAGTACGGACACAAACTGATACAACACGTCCCCGGAGATAGCACCGAGCTGTTGAAATTACTGTGTACAGACTACAAACCTCGCAGTAAACCGTTAGTAGATGAGAGCACTTTATCCGGTAACCTGCGAGAACCCGACAGAGCTGTACCCGATGATTTCATACACATGTTCCTGAGCAATTCTGAGCGTCTCATAGACTTCCTTGAGCATATGGTGACCAAGGACACTCAATGCTCGAGTCTCGTCTACAATGCTCTAATTGAGCATTATATACACGTCTGGGCCAAATCGTCTGAAGCGGACAAGAGGATTTACGAGCAGAAAGTACTCGATATCATCAAAGACCCCGAAGCCAAATACGACAAAGATCAGACGCTCATTATTTGCCAAATGCTGGGATTCAAGAGTGGCATCCTCCAACTATACGAGGAGAAAAGACTATGGCGTGCTCAGATATCTCTCCACCTCCGTACACCGGGCGGCACAGAGCGCGCGCTCGGAGTGTGTCGTCGTCGCGGAGGGAGTGCGCCGCGTTTGTGGCTGGACGTACTATGGGCACCTCCACCACCAGATTACCTTCCAGAACTGCTCAGAGTCGTGGCTGCCGAAAAACTGTTATCACCCATCCTGGTCATCGATTGCCTGGCGAGTACACCGACCTACACACTCGGAGATGTCCGCAAGTACCTGACGGACGTTTTGAAGTCTGAGGACGAAGTGATCACTAGAGAACAGGAACTGGCAGCGAAATACAAGAAGGAGATAGAAGAGATGAAGACTCAGATACACAACATACAGAACGAACCTATCACGTTCCAGAGGAGCCTGTGCGCGGCCTGCAGCAGGCCGCTCGAGTTGCCCACCGTACACTTCATGTGTCAGCACTCCTTCCACAAGGACTGTTTCGAGACGTATTCGGAGTCGGAGCGCCAGTGCGTGGCGTGTTCCCCGACGCTTCGCCCCGCGCCCGCGCCGCCCGCCGACCAGCTGCACTCACGACTACACGCAGACACCGACCCCGTATTATATGTGGTGACTGAGGCGCCGGAGCCCGTACCCTTCAATGTACCCTCAACTGTACCTTACGTACCATCCGTTGTGACTGTACCCTCCGCACCTGTCCCAACTTACGGACCGGGCGCTGAAGCGAAGCTCAGGCTGCAGGAGGGACAAAGCAAACAAGTCTATGTCCAGAACGCTTTGAAGCAAATACCTCCGAAGGGCACGGCGGTGATTCCCGTACCGGAAGGCAGGATGCGTCTCCTGGAACAGCATCAGTACAGTTCCAGCCTGGAAGCCAATATGAGCAAACTGGAACCCTTAGTCCACAGATCCCCTCAACAGTCCCCAAACACCTCCCGGACGAAACCTCCACAGAAAATATCCTCAGCGATCATCGATAGCAAAAATCCCTTCGACACATACGACGAGTCGAAGAATCCCTTCGCAGACGAAGACAACGATCCCACGAACCCCTTCGCCGAAGACGACTATGATAAAAATTTAAATCCATTCGCCTGA

Protein sequence:

>DPOGS215454-PA
MAFLEWRRFTFFDVHNNLDNGKIAECLQDTNVTVATSGHNHVILCDVTGWAHLISRSWEIMSFKAYEMTVLLAQQLPHDPFLVTIGEDESGVTPLIKVWDWSRVDRHGNPQCVRTARAMPSHGHNVQTTALAVHDNKNLLAVGFQDGSVTLYRGEISRQRGIKMKTLPDTGSSPITGLAFKGADKLFVVSRSCVMVCWLTSDRSVVLDAMGAAPGCSVLANSHRLTVAAPDAIYCYTTEGRGPCYALEGEKVRLNWFRSYLVIVTNATGSANTPKSHHITILDIQNKFIVFSKTFEEIDAVLTEWGSFYILQKNKEMIFLEEKDLQSKLLLLFKKNLYDVAIRIASSQHYDVEGLTEIYKNYGDHLYSKGDLKGAIDQYVKTIGWLETSYVIRKYLESRHLEPLVLYLEELHKKGYATEDHTTLLLTCYVKIDQHDQQGKLKEFINSKDKAIDFDVDVAIKVVRQVSATDALSLAYNYKRHDWYLKIVTEDKKDYKQALDYISELEFEDAEMYMKKYGHKLIQHVPGDSTELLKLLCTDYKPRSKPLVDESTLSGNLREPDRAVPDDFIHMFLSNSERLIDFLEHMVTKDTQCSSLVYNALIEHYIHVWAKSSEADKRIYEQKVLDIIKDPEAKYDKDQTLIICQMLGFKSGILQLYEEKRLWRAQISLHLRTPGGTERALGVCRRRGGSAPRLWLDVLWAPPPPDYLPELLRVVAAEKLLSPILVIDCLASTPTYTLGDVRKYLTDVLKSEDEVITREQELAAKYKKEIEEMKTQIHNIQNEPITFQRSLCAACSRPLELPTVHFMCQHSFHKDCFETYSESERQCVACSPTLRPAPAPPADQLHSRLHADTDPVLYVVTEAPEPVPFNVPSTVPYVPSVVTVPSAPVPTYGPGAEAKLRLQEGQSKQVYVQNALKQIPPKGTAVIPVPEGRMRLLEQHQYSSSLEANMSKLEPLVHRSPQQSPNTSRTKPPQKISSAIIDSKNPFDTYDESKNPFADEDNDPTNPFAEDDYDKNLNPFA-