Monarch geneset OGS2.0

DPOGS211794
TranscriptDPOGS211794-TA1215 bp
ProteinDPOGS211794-PA404 aa
Genomic positionDPSCF300107 + 407978-411582
RNAseq coverage1011x (Rank: top 13%)
Annotation
HeliconiusHMEL0079372e-14474.28% 
BombyxBGIBMGA004102-TA3e-15274.94% 
DrosophilaCG5482-PA4e-5635.85% 
EBI UniRef50UniRef50_D6WQV51e-6640.61%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WQV5_TRICA
NCBI RefSeqXP_001662371.11e-7240.00%fk506 binding protein [Aedes aegypti]
NCBI nr blastpgi|1571319403e-7140.00%fk506 binding protein [Aedes aegypti]
NCBI nr blastxgi|1571319401e-7140.00%fk506 binding protein [Aedes aegypti]
Group
Gene OntologyGO:00054885.5e-19binding
GO:00064574.2e-09protein folding
KEGG pathway 
InterPro domain[27-358] IPR0235662.9e-61Peptidyl-prolyl cis-trans isomerase, FKBP-type
[207-345] IPR0119905.5e-19Tetratricopeptide-like helical
[74-150] IPR0011794.2e-09Peptidyl-prolyl cis-trans isomerase, FKBP-type, domain
[217-332] IPR0051582.9e-06Bacterial transcriptional activator domain
Orthology groupMCL13011 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211794-TA
ATGAGCGAAGGAGAGCGTACTACGATTGATAAATCTGAGAGCAGTTCTTTCGAGGACTTAGCCTCTGCAGCTGCGCATGAAATAGAGGAGGCGAAGCTTGCTGATGCAGCGTCGGCGAAAATAGACAAGAAAGAAGAGAAACCTCAGGAAGAGTGGCAGGATGTTCTTGGCTCAGGCGCTTTGCTAAAAAAAATCATAGAACAGGGTGATGAAGCAAGTGGAGAAAGGCCACAAAGAAGTGATATATGTAGGATTAGCTATGAACTTAGACTACAGGATGATCCAAAAAATATTATTGAGAAACGAGATAACTTGAAAATTTATTTGGGTGACAATGAAATCTTACAAGGTTTGGATTTAGCACTTACATTAATGTACCGAGGTGAGGCATGCTTATTACGAATAGCTCCCAGATTCGCTTATGGAGATTCAGGTCTTAAACCTGGCGAGAGTCTTGGTTTGGTCGGTGAAGTGGATAGCCCTAAATATGACGGGGAGGCCATAGGACCGGAAACATGGCTAGAGGCCTCATTAAAACTCCATGATTGGACGGAAGAATCGGAACATGAAACATTGCCTATCGCAGAAAGAATGGAAATAGGAATTCGTCGTCGTTGTCGTGGTAACTGGTGGTATGGCCGAGGGGAGTCCCAGTTGGCTGTTCAGTTGTACCGACGAGCTTTGGATGTACTTGATGAAAGTGAAGGGGGTATTAGTGACCCCACGGCCTCAGGAGATTTGGAACCAGCTTCAGAAGCCCTACACGCCCTGCTGGAGGAACGACTCAGGGTTCATAATAATATGGCAGCTGCTCAGTTGAAAGCCGGTGCCTATGAGGCTGCTTTACAGGCTGTCTCAAGAGTTTTGACCTGTCAGCCTCAAAACGCAAAGGCTTTGTACCGTAAATCACGAATCCTAACCGCAATGGGACGCAATTCTGAAGCCCTAGAAGCAGCTAGGGCTGCTGCAGCTTTGGCCCCCAGTGATATTGGAGTTCGTAAAGAATTGTCGAAATGTGAACAGAAAGCTACAAGAGATAGATCTGTTGAAAAAAAGCTTGCTAAAAGAATGCTAGGAACTGCTGGACAGTCTAAACCAGAACCAGAGAAAAAACCCTCCAGAGCTAAGATGTTCATATGGAGTTCTCTGCTGCTGAGCTTCTTGGTTGGTGTTGCCAGTGTGTTAGTGTACCGCTACAAGATGCAATCAGAATGA

Protein sequence:

>DPOGS211794-PA
MSEGERTTIDKSESSSFEDLASAAAHEIEEAKLADAASAKIDKKEEKPQEEWQDVLGSGALLKKIIEQGDEASGERPQRSDICRISYELRLQDDPKNIIEKRDNLKIYLGDNEILQGLDLALTLMYRGEACLLRIAPRFAYGDSGLKPGESLGLVGEVDSPKYDGEAIGPETWLEASLKLHDWTEESEHETLPIAERMEIGIRRRCRGNWWYGRGESQLAVQLYRRALDVLDESEGGISDPTASGDLEPASEALHALLEERLRVHNNMAAAQLKAGAYEAALQAVSRVLTCQPQNAKALYRKSRILTAMGRNSEALEAARAAAALAPSDIGVRKELSKCEQKATRDRSVEKKLAKRMLGTAGQSKPEPEKKPSRAKMFIWSSLLLSFLVGVASVLVYRYKMQSE-