Monarch geneset OGS2.0

DPOGS206623
TranscriptDPOGS206623-TA1857 bp
ProteinDPOGS206623-PA618 aa
Genomic positionDPSCF300048 - 792365-794844
RNAseq coverage108x (Rank: top 60%)
Annotation
HeliconiusHMEL0123120.065.01% 
BombyxBGIBMGA001897-TA4e-1527.56% 
DrosophilaCG9213-PA6e-10536.73% 
EBI UniRef50UniRef50_UPI0001791F672e-11638.83%UPI0001791F67 related cluster n=1 Tax=unknown RepID=UPI0001791F67
NCBI RefSeqXP_001944739.13e-11738.83%PREDICTED: similar to MGC115403 protein [Acyrthosiphon pisum]
NCBI nr blastpgi|1936180856e-11638.83%PREDICTED: CWF19-like protein 2-like [Acyrthosiphon pisum]
NCBI nr blastxgi|1936180853e-12638.59%PREDICTED: CWF19-like protein 2-like [Acyrthosiphon pisum]
Group
KEGG pathway 
InterPro domain[388-510] IPR0067681.4e-36Cwf19-like, C-terminal domain-1
[519-611] IPR0067672.1e-18Cwf19-like protein, C-terminal domain-2
Orthology groupMCL14632 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206623-TA
ATGAAAGAAAAGCGTTCTAAAAAACATAAGAAAAGCTCTAAGGAAAAACGATCTAGTGAATCAAAGAAAAATAAACAAAAAAGACATTCTTCAGATTCGTCATCAAATTCTGAATCAGATGAGTGGGTTGAGCAGGAAAGAGTTAAAAGTGAGTCTGCTGGTAGAGATGAATGGATGGCTATGACTGGAATGCTGAAAACATATACCAAGGATGACATAAAACCAAAACAAGATAAAACTGAGAAAATGCACATTGATTCATACAACCCCGCCACAAGCAGTAGAGAATTAAACCCATACTGGAAGGATGGAGGTTCCGGTATACCACAGTCATCAGAAAGCCTATCAAAAAGCAGAAAATTTATGAAACCATCATATGATGATGATTATTATAACACATCTAGTTCATCTAGCAAAAATACAAGTGGTAGGCAATACAAGCAGAATAGAGATTATGAAAGATCTTCCAACTGGAGAAAAGAAACAATGGATAAAAGGGAAGAAGAAAGGAAAAAGTACAATTTACATGATGATAAAAAAGATAATTTAAATTTAGAAAGTGATCATAAAGATAAAAAGTTGCAAAAAGATAATGATATACATGACAAATTAAAGACAAGCATTGAAACAAAAAAACCCGATTCTTCAACAAGAGAGAGAAAAGATTTGTATCTTTCCGATGAAAAGATGAATAAACTAGCAGCTAAAATTGTGAAAGCTGAAATAATGGGTGACTTGAAACAAGTAGAAGAACTTAAGTCTAAATTGGAGGCAGCAAGACAGTATAGGAAAAACAATCCCAATGTTGAAAACTTGGAAGACGATAGAGTTTTATTAATGTCTACTACCAGCACTGGAAATAGTAGACCTTTGACGAACAATCCTGTTGATACTAAAGGCAAGGGCAACAAACGAAAGGCCCAAACACATGATTCTGAAGGACGACTGAAATACTTTGGAGATGATGATAAATATAACTTAAAACAAATGTTTGAACAAGAAAAGTATGGTAATAATTACAATGAAGATGCAGAGTTGGTCAGAGCTGCTAACAAAAGTAAAAATCCAAGTGATGATTTAGCTGATATTTTCTTAGATAATATCACAAAGAATAAAAACCCAACAAGGAATAGTGAAATAGAAAAGCAACAGGCCATAAACCAGAATGTTAAGCTCGAAAGATCTTTAGAAGGTTGTGAATATTGTTTTGATTCCAAAAATATGTTGAAGCATCTAATAGTCAGTTGCGGAAACAAGATATACATGGCATTACCATCAAGGACGTCACTTGTCAAGGGCCATTGTATTTTAAGTACAATACAACATTCAAATTGTGTGACCAATGTTGATGAAGATGTATGGGATGAAATATTATATTACAGAAAAATGATTACCCAATATTATAATTCTCAAGATCAGGACGTAGTATTCTATGAAACTGCAACAAGATTACATAGATTTCCACATTTAGTAATTAATTGTGTGCCGATGCCGCGAGACGTTGGAGACACAGCATCCATATATTTCAAAAAAGCACTTTTAGAATGTGAAGCAGAATGGTCTATGAATAAAAAAGTTGTGGAACTAAAGGGAAAGAATATACGAAGAGGAGTTCCTAAAGGATTGCCTTATTTTTGGGTTGACTTTGGCATGGATCCTGGTTTTGCCCATGTAATAGAAGACCAACAATTATTTCCCAAATCCTTTGCTGAGGAAATTATTGGTGGCATGTTAGATCTGGATCACAGCCTTTGGAAGAATCCCAAAAAGGAATATGGAGATATTCAAAGAAAGAAGGTTATAGAATTTGTAAACAAATGGAAACCTTTTGAACAAAATTTTAAAGATAACAGTTAA

Protein sequence:

>DPOGS206623-PA
MKEKRSKKHKKSSKEKRSSESKKNKQKRHSSDSSSNSESDEWVEQERVKSESAGRDEWMAMTGMLKTYTKDDIKPKQDKTEKMHIDSYNPATSSRELNPYWKDGGSGIPQSSESLSKSRKFMKPSYDDDYYNTSSSSSKNTSGRQYKQNRDYERSSNWRKETMDKREEERKKYNLHDDKKDNLNLESDHKDKKLQKDNDIHDKLKTSIETKKPDSSTRERKDLYLSDEKMNKLAAKIVKAEIMGDLKQVEELKSKLEAARQYRKNNPNVENLEDDRVLLMSTTSTGNSRPLTNNPVDTKGKGNKRKAQTHDSEGRLKYFGDDDKYNLKQMFEQEKYGNNYNEDAELVRAANKSKNPSDDLADIFLDNITKNKNPTRNSEIEKQQAINQNVKLERSLEGCEYCFDSKNMLKHLIVSCGNKIYMALPSRTSLVKGHCILSTIQHSNCVTNVDEDVWDEILYYRKMITQYYNSQDQDVVFYETATRLHRFPHLVINCVPMPRDVGDTASIYFKKALLECEAEWSMNKKVVELKGKNIRRGVPKGLPYFWVDFGMDPGFAHVIEDQQLFPKSFAEEIIGGMLDLDHSLWKNPKKEYGDIQRKKVIEFVNKWKPFEQNFKDNS-