Monarch geneset OGS2.0

DPOGS209384
TranscriptDPOGS209384-TA2523 bp
ProteinDPOGS209384-PA840 aa
Genomic positionDPSCF300118 + 257015-260176
RNAseq coverage471x (Rank: top 26%)
Annotation
HeliconiusHMEL0131130.087.02% 
BombyxBGIBMGA005526-TA0.077.74% 
Drosophilalt-PA0.044.86% 
EBI UniRef50UniRef50_UPI00017932780.047.82%UPI0001793278 related cluster n=1 Tax=unknown RepID=UPI0001793278
NCBI RefSeqXP_973665.10.053.99%PREDICTED: similar to light protein [Tribolium castaneum]
NCBI nr blastpgi|910845650.053.99%PREDICTED: similar to light protein [Tribolium castaneum]
NCBI nr blastxgi|910845650.053.99%PREDICTED: similar to light protein [Tribolium castaneum]
Group
Gene OntologyGO:00068865.2e-19intracellular protein transport
GO:00161925.2e-19vesicle-mediated transport
GO:00054885.6e-18binding
GO:00055151.8e-17protein binding
KEGG pathway 
InterPro domain[573-712] IPR0005475.2e-19Clathrin, heavy chain/VPS, 7-fold repeat
[507-764] IPR0119905.6e-18Tetratricopeptide-like helical
[33-226] IPR0110461.8e-17WD40 repeat-like-containing domain
[34-198] IPR0159433.4e-12WD40/YVTN repeat-like-containing domain
Orthology groupMCL13552 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209384-TA
ATGGCTTTAAATCTAGAGAGCTGTGATAGTCTGCCACCAGACGAGGAGCCTAAACTGAAATATGATCGAATGGGCAATGACGTCGAAAATATCTTATTAAAAGATGCTGTGAGCTGCATCTGTGTGCATACCAAGTTTATTTGCTTAGGAACTCAGTGGGGAGTGATACATTTACTGGATCATGATGGAAATACAGTGCCCATCTCACCAGACAATAATCAGAAAGATCTGCAAGCTCATGCTATAGCTATAAACAAAATATCTGTAGACCTAAATGGCGACTACATCGCTAGCTGCTCTGACGATGGCAAAGTGGTAGTGTACGGCCTCTACTCACCAGACAATACTCACAATCTCACATTAGGGAGAGTTGTTAAATCAGTCTCCTTAGATCCTTATTATTTTAAATCTGGGTCAGGACGAAGGTTCCTCACAGGTGATAATAAACTCACCCTGTACGAGAAGACTTTCTTGAATCGGTTGCGAAGTACCGTTTTGTGTGAGTGTGAAGGTTATGTCCAGGCAATAGCGTGGCATGAGCGATTTGTGGCATGGGCCAGTGAGTCTGGTGTGAGAGTGTATGACCTCTCTGCTAGATGCTCTCTGGGTCTCATACAGTGGGAACGGAACCCTAATAGGTCCATCGAAGACTTTCGCTGTAACCTTCTCTGGTCAGCCCCCAAGACACTCATGATTGGCTGGGTGGACACTATCAGAATATGTGTCATAAGGAAAAGGAGTCACATAGAGTTACAAACGCGTGATGTCACAGAATATTTAGTTGATCCAGTGCATACATTTCAAGTGGATTATTTCATAAGCGGCCTGGGACCTCTTGATGATCAGCTGGTGTTGTTGGGGGTACCTAAGGAATGTGATCCCGAAACTGGAAAGGCCCAGAGACCGGTACTGGCTGTAGCCGACTATAAGGATTGTGAATTCTGTGAAGTATCCAACGACAGTTTAAACATTATTGGCTTTCAAGAATATTCTTGTAACGACTACTATTTAGACATGCTCATTGAAGAAAATAGATTCTTCATAGTGTCGCCGAAGGAAATAGTTATAGCGAGTCCCTACGACATCGACGACAGAGTCAACTGGCTAACGGCGCACGAGAGATTCGAGAAGGCGATATCAGTTCTGGAAGAGAACGGCGGTAAAACATCCAAGCATTCCATAGTGACTGTGGGAGTGCAGTATCTAGACCACCTGCTGGCCGAACGCCTGTTCGATGAAGCGGCGGTCTTGTGTGCCAGAATATGTAAAAATGACAAAGTGTTGTGGGAAAATCAAATATTTAAGTTCTCTAAGATGAATCAATTGCGAGCCATCAGCCCCTACGTGCCCAGAAACCCTGGTCAAGCGCTGAGTCCGCATATTTACGAACTCATATTTTTGGAATACCTAAAGGAGGACCCTCAAGGTTTCCTCAGGCTAGTTCAGGAATGGAACCCCGCTCTGTATAAAACCGGAGTCATCATAAAAGCAGTATTAGATTATCTTTTAACGACGGAGGTCGAGAAGAACATTTATTTAGAAGCTCTGGCTCTCCTCTACTGCTACCAGAAGAAATACGACAAAGCACTCACCGCGTACTTACGACTGCAGCACAAAGACGTCTTTAAACTGATCACTAAACACAACATGTACTCCGTCATATACGACAAGATACTGGAACTGATGTCTCTCGACTGTGATAAGGCGATCGCCATCCTCTTACAAGACAAGACCAAGGTCCCCGTCCAAGTAGTGGAGAAGCAGCTGGCCGACCACGACGAGTACTTATTCAAATACTTGGACGCGTACAGTAAAGTCGAACCCAACGGACGATACCACGGGAAGCTAGTCAGGCTGTACGCCAAATACGCCAGGGAAAAATTACTACCGTTTCTGAAATGTAGCGACAACTATCCCATACAGGAGGCCTTGGACGTCTGTCAGAGCAACGAGTTCTATCCGGAGATGGTCTTCCTGCTGGGTCGGATAGGAAACACGAGGGAGGCTCTGCAAATTATAATTGAAAAGTTAGACGACATCAACCAGGCCATAGGCTTCTGTCAGGAACACAACGACAAGGAGCTCTGGACGGATCTCATCAAGCAGACGGTCGACAAGCCGGAGTGCGTGTCGCTGTTGCTGAAGAGGATCGGCAACTACGTGGACCCCAGGATGCTCATAGAGAACATACAGCCCGGCTGCGAGATAAAAGACTTGAAGGACTCCCTGGCCAAGATGATGTGCGACTACCACTTGCAGATGTCGGTGCAGGAGGCGTGCAAGGTCATCACACTGAGGAACTACTTCGACCTCCACGAGAAACTGATCATCAACCAGCAGAGAGGCATCTCGGTGACGGACGAGTTCCTGTGCAGCGTGTGTCAGGGCAGGATCATCATCCGGGACCTGGCCAACGCCTCCAACCTCATCGTGTACAACTGCCGACATTCCTTCCACATAGAGTGCCTCCCGGACGGCGTCAACAACGCGTCCTGCAGCGTGTGCAGCGCCGTCAAGATGTGA

Protein sequence:

>DPOGS209384-PA
MALNLESCDSLPPDEEPKLKYDRMGNDVENILLKDAVSCICVHTKFICLGTQWGVIHLLDHDGNTVPISPDNNQKDLQAHAIAINKISVDLNGDYIASCSDDGKVVVYGLYSPDNTHNLTLGRVVKSVSLDPYYFKSGSGRRFLTGDNKLTLYEKTFLNRLRSTVLCECEGYVQAIAWHERFVAWASESGVRVYDLSARCSLGLIQWERNPNRSIEDFRCNLLWSAPKTLMIGWVDTIRICVIRKRSHIELQTRDVTEYLVDPVHTFQVDYFISGLGPLDDQLVLLGVPKECDPETGKAQRPVLAVADYKDCEFCEVSNDSLNIIGFQEYSCNDYYLDMLIEENRFFIVSPKEIVIASPYDIDDRVNWLTAHERFEKAISVLEENGGKTSKHSIVTVGVQYLDHLLAERLFDEAAVLCARICKNDKVLWENQIFKFSKMNQLRAISPYVPRNPGQALSPHIYELIFLEYLKEDPQGFLRLVQEWNPALYKTGVIIKAVLDYLLTTEVEKNIYLEALALLYCYQKKYDKALTAYLRLQHKDVFKLITKHNMYSVIYDKILELMSLDCDKAIAILLQDKTKVPVQVVEKQLADHDEYLFKYLDAYSKVEPNGRYHGKLVRLYAKYAREKLLPFLKCSDNYPIQEALDVCQSNEFYPEMVFLLGRIGNTREALQIIIEKLDDINQAIGFCQEHNDKELWTDLIKQTVDKPECVSLLLKRIGNYVDPRMLIENIQPGCEIKDLKDSLAKMMCDYHLQMSVQEACKVITLRNYFDLHEKLIINQQRGISVTDEFLCSVCQGRIIIRDLANASNLIVYNCRHSFHIECLPDGVNNASCSVCSAVKM-