Monarch geneset OGS2.0

DPOGS212620
TranscriptDPOGS212620-TA1335 bp
ProteinDPOGS212620-PA444 aa
Genomic positionDPSCF300245 + 180239-181889
RNAseq coverage72x (Rank: top 66%)
Annotation
HeliconiusHMEL0024803e-16680.34% 
BombyxBGIBMGA005214-TA0.081.03% 
DrosophilaCG17265-PA2e-6056.31% 
EBI UniRef50UniRef50_E2ALE13e-7559.93%Coiled-coil domain-containing protein 85C n=1 Tax=Camponotus floridanus RepID=E2ALE1_CAMFO
NCBI RefSeqXP_001605643.19e-7748.70%PREDICTED: similar to GM03282p [Nasonia vitripennis]
NCBI nr blastpgi|3227886955e-7859.34%hypothetical protein SINV_04799 [Solenopsis invicta]
NCBI nr blastxgi|3838658962e-8247.07%PREDICTED: coiled-coil domain-containing protein 85C-like [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[210-326] IPR0193596.4e-47Protein of unknown function DUF2216, coiled-coil
Orthology groupMCL15791 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212620-TA
ATGTGGGCGCAGGTAAGAGAATACGATAGAGCGAGATGCCGCGAGGCTCTACAGACTGCGCAGGCGCAGGATGCAAGCGGTATAACATCCATTGAACCGATGACTGTTCCGGCAACGGTGTCAACGATGTCAATCAGACAAACCGGTGAATTTATAAAACACCAACAACTAGGCAAACAGGGCTCCGAACCGGTCAACTTTCCACCGCGGTATCATCCGCCACCTGCCGCTGCGGGCTCCAAATTACCCACTACGGACGCGGCCGCTAAGGAATTCAAACCTAATACGACTAAAACTATAGATCCCAATAAGTTTCAATATCCAGTGGGAGTGCCGAATACGACTCCGGGTTTATACCCCTCCGGCGCCTATCCTCACCTGAAGTATTTTCACGGTCCCGCGGGATATCCATTACCTCCGGCAGCTTTACGGATGATACGACCAAGTGGTGAGATCGTCACGAAAGCTATAGTACACGAACCTGTCGAAGCTACAGCGAGGGCTAGGAGTGCTGACGATACACAACGTGCTCATCGCAATTTAGAAGAGGAACAGATACACAGACGATCCGCTGACATGCTCAAGTTCACGAAACGTATCGAATCTGATGCTGCTCGTCAATCGACTGATCAAAGGCGGCAAATTCAAACACTCTTAGACGAAATAAAAGCATTGAAAGAGGCTAATCGGCGATTAAGCGAAGATAATCAAGAACTCCGTGACTTGTGTTGCTTTTTAGATGACGATCGTCAAAAAGGTCGTAAACTAGCGCGCGAGTGGCAACGGTTTGGAAGATACACGGCTTCCGTGATGAGACAAGAGGTGTCGGCTTACCAAAACAAACTGCGTGAGTTGGATGACAAACAACAAGAACTCATACGAGATAATCTAGAATTAAAAGAGTTATGCCTGTACTTAGACGAAGAACGCGAGCGTATTTCTTGCGTGAATTGTGGAAGGACAGCGACCAGAGAGAGGGACGACGGGGATGGAAGCAGCAGTGGAACAAACGCTGAAGAAGTATCACGACCACAGGCTGCTTCTTTATCAATAACGCATCCCCAGCTTGCTGAACGTACAGTTCAATACGTGAGAGACTTGGAGCAAAAAGTAAGACGATTAGAAGCAGAGAAAGGTATGGGGAATGGACTTAACGAGAGGCCAGAAGCGGTGGTACGCGCCCTGCAAGTGTTGGAAGTAAGAGAAAGAGTCGAGAGAGAGAGAAGAAGACCAGCTCCTGACTTAGATTGTGGCGAACAAGCGTTAGTGAGGGAAATGTGTAACGTTGTTTGGGGAAAATTAGAGGATGCGCCACCGCAGGCGCCGCCGCGTTGA

Protein sequence:

>DPOGS212620-PA
MWAQVREYDRARCREALQTAQAQDASGITSIEPMTVPATVSTMSIRQTGEFIKHQQLGKQGSEPVNFPPRYHPPPAAAGSKLPTTDAAAKEFKPNTTKTIDPNKFQYPVGVPNTTPGLYPSGAYPHLKYFHGPAGYPLPPAALRMIRPSGEIVTKAIVHEPVEATARARSADDTQRAHRNLEEEQIHRRSADMLKFTKRIESDAARQSTDQRRQIQTLLDEIKALKEANRRLSEDNQELRDLCCFLDDDRQKGRKLAREWQRFGRYTASVMRQEVSAYQNKLRELDDKQQELIRDNLELKELCLYLDEERERISCVNCGRTATRERDDGDGSSSGTNAEEVSRPQAASLSITHPQLAERTVQYVRDLEQKVRRLEAEKGMGNGLNERPEAVVRALQVLEVRERVERERRRPAPDLDCGEQALVREMCNVVWGKLEDAPPQAPPR-