Monarch geneset OGS2.0

DPOGS206856
TranscriptDPOGS206856-TA1857 bp
ProteinDPOGS206856-PA618 aa
Genomic positionDPSCF300001 - 2863531-2865387
RNAseq coverage133x (Rank: top 56%)
Annotation
HeliconiusHMEL0061410.088.35% 
BombyxBGIBMGA012808-TA0.080.39% 
DrosophilaCG7289-PA5e-16647.15% 
EBI UniRef50UniRef50_Q9VQ607e-16447.15%CG7289 n=20 Tax=Diptera RepID=Q9VQ60_DROME
NCBI RefSeqXP_319149.20.050.41%AGAP010005-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|583921660.050.41%AGAP010005-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|583921662e-17950.16%AGAP010005-PA [Anopheles gambiae str. PEST]
Group
KEGG pathway 
InterPro domain[11-612] IPR0187954.1e-195Protein of unknown function DUF2152
Orthology groupMCL12771 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206856-TA
ATGAGTAGAAGAGCCGGAGAATTTTTAAGGCGACTTAACAGGTTCGCCGATGGTCCATTGACGAGGAAAAAGTTAGTCCTTATCATCCTAGTCGTGATATTCCTGCTTCTGTATGTAGGGCCGACAATCATGAACTCTCTGTTCAACAGGGAAGGTTCTCTGACAAACAACCAAAATATTTGTCACCGGAACTTTTTAAACCCCTTTCAAGATGCTCTTAATGAATACGATGCATATTTACGATTGGAATCTTCCGCCACATTGTCGACTTTGAGCCACAACTATGTGCCTTATGTAGGGAATGGAGTTTTGGGATTAACATTGGAACATGATTCCTATATGAATATTAAATTTGGCAGAACTTTATCACTGCCAGTATACTACCATCCCCTTTACATAATAGATGATTTAGATATGAAAGAATCGACATTAGTGGATTACAAGAATGGTATTGTACACAGATTTCAATGCAGCATTACTGGGATACATGTTTCATATAAATATTATGCTCACAGAACTACACCATCATTATTTGTACAAGAAATATTAATAAATAACCCTCTAAATGCTCCTAAAACAATCCACTTCTCCACACCTAGACTCTCTGATTGGCCAACATCTGTAAAACAAGTAATCAAATTGCATCAAGGTGTAGATGCTAAAGAGTACGAAGTTGTCACAGGCATGATTGATTTGCCAGAAAGCGAGAATGTAATCGCTGTGTCTGTGGTATGTCGGAAGATGAACAGCATTTTACATATTGAAGCTAGGGATGGTGTGGATCTTTTAATATTAACAACTATACAATACAGTAAACCAATAAAAAAATCAGATTATGCACAACAAAAAGATATTGTTGAGAAAAAAGCTATTGCTGAAATGGAGAAAGTAATAGCAATGACTGGTGATAGAGTTGCTGTGAGAAAATTAAGAGATAGTCATTCTACAGTCTGGCAGGGTTTGTGGGAGACTGGCTTCTACATATCAGATTCTAAAGCTGAAGGCGTCATCAATGGTGATCGCATTAATGCTACTATATACGCTATTCTTTCTCAAGTAAGAAGCTATGAACACGAGGAACATATCAAACCATCCACCAAATCAGAGATATTAAAAACTCTAACCTATTCTGAAGGTTGTTATGAAGGTTATTCAACATTGGATGCATTTAATTTGTGGAAACCTTTAAATTCATTGTCTAATTTAGACAATGTAGCAAGTAGATGGCTGCTTACTCTAGAAAAACAAGGATGTCACAATCTTCTTAAGGCTGGAGCAAGTGGTGTAAACCAGGCCATGATACTTAGTTTTGGTAGCTTAAGGTTTAGCAATCAACACCTCGAGTACAATATCCATCCATCAAAATTACATAGAGACTTTTTGTTTCGTAGAATCAACTATGGAAACAGAACCCATGTCAACATAAGTGTTATTTTACAAGAAGATAACAAGGCTGCTATATTTGTGGCTTTGGATCGCTCCGATAAAACATATTATGCCTGCGATGCTGGCTGTCTTGATTCACCTGTTCAATTGGGTCCCTACAGAAAGTATTTCCCTGTAAAATTAACCGAACCATTGACAGCAATTCTGTACATAACAGCTGATAAACAGCATATGGAGGACTTACGTCATGCTATCCATGTTCACGAGGTAGTAGAGGCACCGGCCCATGAGCATCATGTCATAGCGCTACACAGACATGGCACTACTTTTGGCGGCCTAAACCCATTAGTTTGGGCATCTATCATAATTTTAATAGTTATTTTTCACTTATTCCTTTGTAGAATCATAATGAATGAGTTCTGTGATAGCGGCACAAATATATCTTACAAGAGGTTGTATAACAAGGCTTAA

Protein sequence:

>DPOGS206856-PA
MSRRAGEFLRRLNRFADGPLTRKKLVLIILVVIFLLLYVGPTIMNSLFNREGSLTNNQNICHRNFLNPFQDALNEYDAYLRLESSATLSTLSHNYVPYVGNGVLGLTLEHDSYMNIKFGRTLSLPVYYHPLYIIDDLDMKESTLVDYKNGIVHRFQCSITGIHVSYKYYAHRTTPSLFVQEILINNPLNAPKTIHFSTPRLSDWPTSVKQVIKLHQGVDAKEYEVVTGMIDLPESENVIAVSVVCRKMNSILHIEARDGVDLLILTTIQYSKPIKKSDYAQQKDIVEKKAIAEMEKVIAMTGDRVAVRKLRDSHSTVWQGLWETGFYISDSKAEGVINGDRINATIYAILSQVRSYEHEEHIKPSTKSEILKTLTYSEGCYEGYSTLDAFNLWKPLNSLSNLDNVASRWLLTLEKQGCHNLLKAGASGVNQAMILSFGSLRFSNQHLEYNIHPSKLHRDFLFRRINYGNRTHVNISVILQEDNKAAIFVALDRSDKTYYACDAGCLDSPVQLGPYRKYFPVKLTEPLTAILYITADKQHMEDLRHAIHVHEVVEAPAHEHHVIALHRHGTTFGGLNPLVWASIIILIVIFHLFLCRIIMNEFCDSGTNISYKRLYNKA-