Monarch geneset OGS2.0

DPOGS205921
TranscriptDPOGS205921-TA1302 bp
ProteinDPOGS205921-PA433 aa
Genomic positionDPSCF300975 - 346-4183
RNAseq coverage41x (Rank: top 72%)
Annotation
HeliconiusHMEL0149843e-9779.07% 
BombyxBGIBMGA000114-TA1e-2858.39% 
DrosophilaCG5913-PA7e-2068.25% 
EBI UniRef50UniRef50_UPI0001791B4C4e-5937.76%UPI0001791B4C related cluster n=1 Tax=unknown RepID=UPI0001791B4C
NCBI RefSeqXP_001943023.17e-6037.76%PREDICTED: similar to FAM98A [Acyrthosiphon pisum]
NCBI nr blastpgi|3320163345e-4839.49%Protein FAM98A [Acromyrmex echinatior]
NCBI nr blastxgi|3838536241e-7839.36%PREDICTED: protein FAM98A-like [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[3-170] IPR0187971e-42Uncharacterised protein family FAM98
Orthology groupMCL12634 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205921-TA
ATGCACTTCAGTTACGAAGGGCCATTATCTAACGAAGAAGCCTTCGCCAAAGCTTTGGAAGTTGGACCCAAGTCTCTTGAATATACAAAGTTAGTGCACATACTGGCCGAAGAGTTAAAGATGTTGTGCAGTTTAGAAGAAAATGTCAGCATTATGAATGATTCCGACGAGTCCAGTTCGTTTCTTCTAGAACTGAGCTCATTTCTGAAAGAGCTTGGATGTCCATATAAAAAATTGGTGACAGGCCATATGTCATCTCGTCTCCAGACCAAAGAGGACAGAATACTCCTTCTAGATTATCTGGTGTCGGAGTTAATGGCCGCTAGGATGGTCAGCGTTGATTGTGTAAAAGAAAATACGGGCATGCAGATTGTTATGCAAGAATCTCAAACAGCTAAAGATCTTAAGGACATATTGATAACTCTTAAATTCAACAAACCTCCCCCAAACATCTCACCGGACATGCTGTTTGCTAAACTGGAAGCTAAACTTAAGGATGCCATTGCCAAAGAAGGTATTTTATTAATAATGCCTAAGAGAGAACAGTTGAAATTGAAGCCAGCAGTGAATTTATCAGATTTCCTCGCAGCTAGGACGGATTTGCTGTACGTGGAAAAGACATCCAGCGCTAGTGTCAGGAAAAACACTATCAGTGATGTCAATAAGGTCTTAATAGGAAGGGTACCCGATAGAGGTGGTCGACCAAATGAGGCCCAACCTCCTCCTCCGGAGATGCCCTCATGGCAGCAGAGATCTACACAGGGGGGAGACCGAGGAGGTAGGGGAGGAGGTCAAGATAGAGGAGGGAGGGGCGGAGTTCAAGGAGGTAGGGGAGGAGGTCAAGATAGAGGAGGGAGGGGCGGAGTTCAAGGAGGTAGGGGAGGAGGTCAAGATAGAGGAGGGAGGGGCGGAGTTCAAGGAGGTAGGGGAGGAGGTCAAGATAGAGGAGGGAGGGGCGGAGTTCAAGGAGGTAGGGGAGGAGGTCAAGATAGAGGAGGGAGGGGCGGAGTTCAAGGAGGTAGGGGAGGAGGTCAAGATAGAGGAGGGAGGGGTGGAGGTCAGGAGAGGGGAGGGAGAGGAGGTCAGGGGGGTTACAGGGGAAAGGGCGAAGGTCGAGGTGGTCGAGTCCAAGGTGGATATAACCAGAGCTCAGGGGATGTGAGACAATCAAACTATCAGAAGGGTTACGACCAGAACCCTGGCTATGATAACAGATACAATCAGCCGAATAATCAAGGTGAGATACATTTTTTCATTATTATTACAGAAATTATTAATATTTTATATATATATTTATATTAA

Protein sequence:

>DPOGS205921-PA
MHFSYEGPLSNEEAFAKALEVGPKSLEYTKLVHILAEELKMLCSLEENVSIMNDSDESSSFLLELSSFLKELGCPYKKLVTGHMSSRLQTKEDRILLLDYLVSELMAARMVSVDCVKENTGMQIVMQESQTAKDLKDILITLKFNKPPPNISPDMLFAKLEAKLKDAIAKEGILLIMPKREQLKLKPAVNLSDFLAARTDLLYVEKTSSASVRKNTISDVNKVLIGRVPDRGGRPNEAQPPPPEMPSWQQRSTQGGDRGGRGGGQDRGGRGGVQGGRGGGQDRGGRGGVQGGRGGGQDRGGRGGVQGGRGGGQDRGGRGGVQGGRGGGQDRGGRGGVQGGRGGGQDRGGRGGGQERGGRGGQGGYRGKGEGRGGRVQGGYNQSSGDVRQSNYQKGYDQNPGYDNRYNQPNNQGEIHFFIIITEIINILYIYLY-