Monarch geneset OGS2.0

DPOGS210173
TranscriptDPOGS210173-TA1686 bp
ProteinDPOGS210173-PA561 aa
Genomic positionDPSCF300393 - 94442-100238
RNAseq coverage359x (Rank: top 33%)
Annotation
HeliconiusHMEL0127530.084.31% 
BombyxBGIBMGA014142-TA6e-14365.69% 
DrosophilaCG11396-PA3e-8441.90% 
EBI UniRef50UniRef50_E2BS493e-9441.42%Tetratricopeptide repeat protein 15 n=8 Tax=Formicidae RepID=E2BS49_HARSA
NCBI RefSeqXP_973508.13e-11842.63%PREDICTED: similar to d-alanyl-d-alanine carboxypeptidase [Tribolium castaneum]
NCBI nr blastpgi|910865856e-11742.63%PREDICTED: similar to d-alanyl-d-alanine carboxypeptidase [Tribolium castaneum]
NCBI nr blastxgi|910865851e-11342.63%PREDICTED: similar to d-alanyl-d-alanine carboxypeptidase [Tribolium castaneum]
Group
Gene OntologyGO:00054887.6e-14binding
GO:00301265.1e-08COPI vesicle coat
GO:00068905.1e-08retrograde vesicle-mediated transport, Golgi to ER
GO:00051985.1e-08structural molecule activity
KEGG pathway 
InterPro domain[367-526] IPR0119907.6e-14Tetratricopeptide-like helical
[457-542] IPR0068225.1e-08Coatomer, epsilon subunit
Orthology groupMCL14239 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210173-TA
ATGGACAATAAGCCAACCTTGAGCCAATACTTTGGTGGCTCAGAAATTCCACCAGCCTCACAGTTTTTTGATGAAATAGGAACATCTCCTTCAGAAATGATTCAAAGTATTTATCTTGGGGAATCAGAAGCTGGCATAAGTGCACCCAGAGCTTTCACAAGTCCTCCTGACCCTCAAGAAATATTCACTAATGTAATACCCACTGTGAGTTCGGCAACTGCAGTCCCGGCTACCAGCTTACCAGATCCATCAACATTCTTTGATACAATTGGACCCGAACCTCAAATTGGTTCAAGTAAGAGCAATCTTATTTCTGCTGCTGTCATTACCGAGGCCATGGACGGACTAAGTGTTAAGAATCAACCAGTTGATAAGGAAGGAGATAGAAGAAGAGATGCTTGGATACCTAATGAAGAGGCAAAAAAAACCTTGCTAAAGGCTCAGTCTTCTCCAAAAGGATCGTTTTTTCCCGAAAGGGAAGTCCTCACTATGCCTGGGTTAGTCCTAGAAGAGGAATTGGCTGATGCACTTCAAGAGATAGCTGTGAAGTATCTTGGTGTGTCATCAGCTGGTAGCCGTGGTGTGGTCCGAGCGGAGCATGTGAGTCGGGACGAGGCAGGGCTCCGGGAGCTTCTACGGACAGGCTACTTGAGGGCTGCGGTAAATCTAACTGCTACATTACTGAGTGCTGCTGGTCAGGGCGCTGGTCGTATGCACAGACCGACGAAGCACAGTCAACGCTCACTACAGCTGTGGCTGACACGGTTCGCTGTCATGTGTCGCATCAAGTTGTACGAACCTTTACTGAAAGAGGCCGAACCGTTTGGAGATTTCACTAAACCGGATATGTTTTATGAGTTTTATCCGGAGGCTTACGAGAATCGAACCGGCTCGTTGGTACCGTTCTCGTTGCGTCTCCTCGTCGCTGAGCTACCGGGACACGTCGGCAAACCCGAAGAGGCCATGGATAGATTATACGCAATGCTAGATGTTATTGAACAGATGATATCAAACCTTAAATCCGGTAAGACGGAGGTTGGCACAGATAATATATCAGCTGAAGATCAAAAAGAATCACTGCGACTGTGGAACGGCAGGCGGATACGAGTTTTGCATTCAATAACAAACTGCGCTATAGCTCTCAAGGATTACCGTCTAGCGACGAAGATTCTGACAACTCTTAAAAATGAAGCGACTAACGTACAACAGCAGCGAGCGCTGCACAGCGCTTTATGCCGCGTAGCGCTGCTAGCTGGACACGGACGCGCCGCCGTAGCACACTGTAGCAATGCGAAGGACGCCAGAAATCATATCTGCCCAACTCCAGATGTAAGGGAGTATGTCGACTTGGGCTTAATAGACATAGCGCACGGCAAGTATCAAGACGCCTACAATAACTTTGCGAGAGCAGCTGATCAAGAACCTACTAATATTATGGTAGCTAACAATTTGGCTGTGTGTCTCTTATACATGGGTCGTTTGAAAGAAGCTATATCCGTTCTCCAGAAGGCCATACACTCGGATCCTGAGCGAGGTCTGAATGAAAGTCTTCTCATAAATCTGTGCACTCTCTACGAACTCGAGTCGTCAAAGACAAATGAAAAGAAACTTAACTTGCTGAGAATGCTTTGTAAACATAAAAGCGATACTATACCTAATGTATTGGAATGTCTGAAACTTGCTTAG

Protein sequence:

>DPOGS210173-PA
MDNKPTLSQYFGGSEIPPASQFFDEIGTSPSEMIQSIYLGESEAGISAPRAFTSPPDPQEIFTNVIPTVSSATAVPATSLPDPSTFFDTIGPEPQIGSSKSNLISAAVITEAMDGLSVKNQPVDKEGDRRRDAWIPNEEAKKTLLKAQSSPKGSFFPEREVLTMPGLVLEEELADALQEIAVKYLGVSSAGSRGVVRAEHVSRDEAGLRELLRTGYLRAAVNLTATLLSAAGQGAGRMHRPTKHSQRSLQLWLTRFAVMCRIKLYEPLLKEAEPFGDFTKPDMFYEFYPEAYENRTGSLVPFSLRLLVAELPGHVGKPEEAMDRLYAMLDVIEQMISNLKSGKTEVGTDNISAEDQKESLRLWNGRRIRVLHSITNCAIALKDYRLATKILTTLKNEATNVQQQRALHSALCRVALLAGHGRAAVAHCSNAKDARNHICPTPDVREYVDLGLIDIAHGKYQDAYNNFARAADQEPTNIMVANNLAVCLLYMGRLKEAISVLQKAIHSDPERGLNESLLINLCTLYELESSKTNEKKLNLLRMLCKHKSDTIPNVLECLKLA-