Monarch geneset OGS2.0

DPOGS210261
TranscriptDPOGS210261-TA1410 bp
ProteinDPOGS210261-PA469 aa
Genomic positionDPSCF300216 - 257920-259341
RNAseq coverage274x (Rank: top 39%)
Annotation
HeliconiusHMEL0035220.076.28% 
BombyxBGIBMGA002304-TA3e-16871.32% 
DrosophilaCG10055-PA2e-2525.89% 
EBI UniRef50UniRef50_E2C2899e-8643.58%Protein SHQ1-like protein n=8 Tax=Formicidae RepID=E2C289_HARSA
NCBI RefSeqXP_623827.28e-8944.60%PREDICTED: similar to SHQ1 homolog, partial [Apis mellifera]
NCBI nr blastpgi|3504145753e-9544.52%PREDICTED: protein SHQ1 homolog [Bombus impatiens]
NCBI nr blastxgi|3504145752e-10142.58%PREDICTED: protein SHQ1 homolog [Bombus impatiens]
Group
KEGG pathway 
InterPro domain[207-387] IPR0070092.9e-58SHQ1 protein
Orthology groupMCL15579 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210261-TA
ATGCTTACTCCGAGATTTAAGTTATCTCAAGACGAAAATCATGTGTTTATAACCGTACATGCACCCTATACAAATATCGGCGAGTCGGAGATCGATATTGATGGCGAGAATTTTCTATTTGTGTCTAGTCCATACTTCCTTAGACTGCGACTACCTGGAAAAATTGTTGAAAACGATCAGTCCAAAGGCTCTTATATCTGTGATTCTGGAGATTTTAATTTCACTTTCGACAAGGAAACACCTGGGGAACACTTCGAAAACCTCGATATGATTACCAGCTTGCTGGCTCCCCGGGATATACCCGATGTGAATCCAAATTTAGTTGAAATGCTAGAAGAAGATGGCGTCAATATATCCGACAATGAAGATGATGATAAGGAGGAATGCAATAAATACTCATATGGATTTGCAAACAAAATATCAACAGACTTCAAAGATGTAGGCGAGGAATTTCCGCAGATATTCGAACTTCGCGTACCAGAAAGAGTACCTATTGCAGAACGCAATGAATTGCGCGAAAAATACGAAAACCACAAATTTTCGTCGGATCACTATTTGGCAGATTTGTATGAAGAAGAACTTCTTGCTCCACACCTAGCCGTTGTCACCGAGTGGTGTGCACCCGATTTTAATAAAGAAATCGATTTCACAGATGAAGAAGTCAGCATATTAAAGGAACTGCCAAATAAGCATTACTTATTGAGCAAAAAAGAGCAAAAGCAAGTGTTTATGGGGCTGGTTGATATTTTATATGGTTATTGTTATGATAAACGCACAACTCAGAATGAAAGCAATGTGGAATCTAGTTGGACAATCAATAAATTGTCACCAACTTTGAGTTGCTTTTGCACATTTAATGATACTAAAGAAGTGCTTATAGCCTGCTACCGGCGTGCTCTCGTATTCCCGATATTCCGTAACTTTGAACTGTGCACGAAAGTGCATAAGGACTTGGTAGCATTATTGAAGATAGGAAAGAAATATGTCATCAAATGTCTGATCAGTGTTTTTACAATGTTTAATCTGAATACTGAAGCAAGGTACATATTGAACCAGTTGTACATCAAGGATTATTTAATATTTTTACAAAAATGTCGTGCCGAGGAGTTTGACCAGCTCTCCGATGAGATAGATAATATTGAAATATTGAAAAAGGATTTAGGACTTGAATTAGAGGAGTTAGAAGCAGCCGCAGAAATGGTTAAACTTGAAGAGACACAGATAATGGAAAATGAAATGGCTGTAAAGATGGCTCATATGTCACTGTTACCTGGACTTAAGAAGGCTACATACATGAGCAGTGATGAAACAGATGATTCCACCGATTCAGACGATTCTTCCACCGATTCGAGCAGTGATTCCTCTTCCAGTGATTCATCGGAATGGGATTCCGACGATGAGAGATCCTGA

Protein sequence:

>DPOGS210261-PA
MLTPRFKLSQDENHVFITVHAPYTNIGESEIDIDGENFLFVSSPYFLRLRLPGKIVENDQSKGSYICDSGDFNFTFDKETPGEHFENLDMITSLLAPRDIPDVNPNLVEMLEEDGVNISDNEDDDKEECNKYSYGFANKISTDFKDVGEEFPQIFELRVPERVPIAERNELREKYENHKFSSDHYLADLYEEELLAPHLAVVTEWCAPDFNKEIDFTDEEVSILKELPNKHYLLSKKEQKQVFMGLVDILYGYCYDKRTTQNESNVESSWTINKLSPTLSCFCTFNDTKEVLIACYRRALVFPIFRNFELCTKVHKDLVALLKIGKKYVIKCLISVFTMFNLNTEARYILNQLYIKDYLIFLQKCRAEEFDQLSDEIDNIEILKKDLGLELEELEAAAEMVKLEETQIMENEMAVKMAHMSLLPGLKKATYMSSDETDDSTDSDDSSTDSSSDSSSSDSSEWDSDDERS-