Monarch geneset OGS2.0

DPOGS202303
TranscriptDPOGS202303-TA1236 bp
ProteinDPOGS202303-PA411 aa
Genomic positionDPSCF300032 + 242089-244050
RNAseq coverage314x (Rank: top 36%)
Annotation
HeliconiusHMEL0047330.078.83% 
BombyxBGIBMGA004975-TA7e-7472.00% 
DrosophilamRpL37-PB8e-8738.29% 
EBI UniRef50UniRef50_B3DNM71e-8438.29%RH29922p n=5 Tax=Sophophora RepID=B3DNM7_DROME
NCBI RefSeqXP_001662478.14e-8740.81%mitochondrial ribosomal protein, L37, putative [Aedes aegypti]
NCBI nr blastpgi|1571321317e-8640.81%mitochondrial ribosomal protein, L37, putative [Aedes aegypti]
NCBI nr blastxgi|1892391563e-8941.89%PREDICTED: similar to LD25118p [Tribolium castaneum]
Group
Gene OntologyGO:00058401.9e-08ribosome
GO:00064121.9e-08translation
GO:00057391.9e-08mitochondrion
GO:00037351.9e-08structural constituent of ribosome
KEGG pathway 
InterPro domain[48-384] IPR0107931.9e-08Ribosomal protein L37/S30
Orthology groupMCL12917 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202303-TA
ATGAAATTAACAAAAGTACTATGTAGGCAGCATATAGATTTTATGTTCAAAAGGCATTGGATCATTCAAGGTAAACGTGTGCCGATAAATTTGGGTATTGAACAATATCTGAAGGAAAAAGGGATCCCAGTGCAAGATGCCCTAGAGTTTGTAAAAGAAGAACCGCCGCAACGAGAAAAGGTAAAAATTATTGGGCCCTATGAAATGCCATTACCTCTAGACGAAAATCATCCTAATTATAAAGAGAGACCATGTCTAACCTTAAAACACACAAATGTACTCTTAGAAGGTATTTCACAAGCACAAGTTCTCACTAAAAGCATAATATCGGAAGACAAACTACCTCATAGAATCGAAGAACTCGCAGAATTGCCAGCACCAAAATCGGTACACGAAGGCGTTAAGAAAGCAATACTTAATGCCAACATTTTTGATTGTGAACAGAAAAAACTGCCAAAAATCAAAGATCCTGAAAGGCCAGCCTATAACTTTCCACGCATTTTAGGAATAACCGACAAAAGGAAAAATGAAATCCTAACAAATAAATTGCTGCATATAATCGAAAAGAGCAATGATTCGGAGGTCACTCTCAGCAAATATGTTGTGAATGATATTGAAGCCAGCTGTGTATTTGACAAGGACGGTGATCTTATTCAGTTCCAAGATGTTTCTAATATCTTAGTAACCAGCAACAAACCTCTCAAGCATGAGCTCAATGAAGCTGATGCCTCATACATTGATATACCGGACCTGTACCCCGTTAAACACACAGTTACACTACCACCTGAACATTTTTATAACGAAAGCAGTTTTTACCCGATCCAACGTAGTGTGCCTATGAAGCACCCGCACACAACCTGGCTTCATTTCAACAAGACAGAAATCTCCAACATATTTGAGACTCCGGTTACTCCGTCCCAAATCCTCGGCCGCTCTCTGACACACGCATTCACAATAGCTTCCTCATACGCAAAGCAGTTGTATGGGGAGGATGTCAAAGACCTACCTGATCCGGTACACATAAATTGCATCCAGACAGACGGTCAGAGATTTCATTTCGGTGTCTTTGAACTGAACACATTAAACGTCGACGGCACGGATGGTACTAAGAACGTTTGGTATAGCAAGAACAACATGAAGTTATACGACTCCAGCAGATATCTGGACGGTGCGCCCGTGCTAGAGAATTACAACCCTAAAGTTTACGGATATATAAATGCCTTTTACAACTGCTAA

Protein sequence:

>DPOGS202303-PA
MKLTKVLCRQHIDFMFKRHWIIQGKRVPINLGIEQYLKEKGIPVQDALEFVKEEPPQREKVKIIGPYEMPLPLDENHPNYKERPCLTLKHTNVLLEGISQAQVLTKSIISEDKLPHRIEELAELPAPKSVHEGVKKAILNANIFDCEQKKLPKIKDPERPAYNFPRILGITDKRKNEILTNKLLHIIEKSNDSEVTLSKYVVNDIEASCVFDKDGDLIQFQDVSNILVTSNKPLKHELNEADASYIDIPDLYPVKHTVTLPPEHFYNESSFYPIQRSVPMKHPHTTWLHFNKTEISNIFETPVTPSQILGRSLTHAFTIASSYAKQLYGEDVKDLPDPVHINCIQTDGQRFHFGVFELNTLNVDGTDGTKNVWYSKNNMKLYDSSRYLDGAPVLENYNPKVYGYINAFYNC-