Monarch geneset OGS2.0

DPOGS202374
TranscriptDPOGS202374-TA1161 bp
ProteinDPOGS202374-PA386 aa
Genomic positionDPSCF300104 + 182327-186105
RNAseq coverage359x (Rank: top 33%)
Annotation
HeliconiusHMEL0028922e-14662.88% 
BombyxBGIBMGA013897-TA2e-13859.62% 
DrosophilaCG12822-PA9e-7238.52% 
EBI UniRef50UniRef50_A1Z7591e-6938.52%CG12822, isoform A n=14 Tax=Drosophila RepID=A1Z759_DROME
NCBI RefSeqXP_002063544.14e-7238.08%GK21968 [Drosophila willistoni]
NCBI nr blastpgi|1954310167e-7138.08%GK21968 [Drosophila willistoni]
NCBI nr blastxgi|1954310161e-6838.08%GK21968 [Drosophila willistoni]
Group
KEGG pathway 
InterPro domain[72-375] IPR0233701.6e-61Uncharacterised domain UPF0066, YaeB-like domain
[90-211] IPR0013784.7e-44Uncharacterised domain UPF0066
Orthology groupMCL15247 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202374-TA
ATGACAGAAAATATCGAGTTTTATCAGAATCAGATAGCACTGGCTAGAACGGAAATAAAAAATCTGAGGCAACGCATTTCAGCTCTTAAACATGAACATCAAAAGGAAATAAGCCATATAAAATCAACGCTCAGCAGTCTTCGTTGCTCTAAATGTGCTGAAGAAGCTACGACCGTAGTGAACAATGACACTCGTGATGGAGGTACATCAACGGACTCACAGATCAAGTACAAACCTATTGGATATATTGAGACATCATTTAACAACAAACGAGGGGTGCCACGGCAGACTTCGGTGATGACAAATTCTGTTGGTGTTATAACAATCGACACAAATGTTTTTACCAACCCTGAACACGCCCTCAGCGGCTTGGAAGAATTTTCTCATATATGGATAATATATCATTTTCATATGACGGAGAGTAATAGTACTCCGGCTAAGGTATCACCACCTCGTTTGGTTGGCGAGAAGAAAGGTGTGTTTTCAACGAGGTCCCCACACAGACCGTGTCCTATAGGACTGTCGCTTGTTAAAATTCATAGCATACAAGGCAATAAGATACATTTCTACGGTGTGGACATGGTGAATGGGACGCCAGTGTTAGACATTAAGCCATACATACCCCAATATGACTACCCAATATCGGAAGCACAGGAGCGACCGCCCACTGAAGGAATCGATTTGGCCGGTCTGAATCTTAATGTATCAAGCTTAAACGTAACAACAGAATATGGTGATGAACTGCCCGGGGATTTATTGACACCGTTGACACCGTCGGAGAATTTGGATGATGTTTTGTCCATACAACGAGGAGAACCGGACGGTCAGGAACGGTACACAGCGCAGGCTTCAAACTTAAACCAAGCTGATGTAAGAGTGGCCTCGTGGATAACGAACACGAGGAACAGATATCAAGTGGCTTTCACCGATGAAGCTCTCATGAGGATAGAAAATCTCATCGGAAGCAGAGCTGATAGTTTTAAAGCTAATATAGAGAGTTTACTGTCAGAAGATCCGAGATCCGTGTACGTGAAAACTAAATACAAAGACCACGAGTACAATTGTGTGTTAGAGGATCTGTCTATAACCTGTGTGTTTGATGAAACGACCTCTGTGTGTACCATCATAGCTGTGAAGAGTGCCGACGAGTTACAGAATTGA

Protein sequence:

>DPOGS202374-PA
MTENIEFYQNQIALARTEIKNLRQRISALKHEHQKEISHIKSTLSSLRCSKCAEEATTVVNNDTRDGGTSTDSQIKYKPIGYIETSFNNKRGVPRQTSVMTNSVGVITIDTNVFTNPEHALSGLEEFSHIWIIYHFHMTESNSTPAKVSPPRLVGEKKGVFSTRSPHRPCPIGLSLVKIHSIQGNKIHFYGVDMVNGTPVLDIKPYIPQYDYPISEAQERPPTEGIDLAGLNLNVSSLNVTTEYGDELPGDLLTPLTPSENLDDVLSIQRGEPDGQERYTAQASNLNQADVRVASWITNTRNRYQVAFTDEALMRIENLIGSRADSFKANIESLLSEDPRSVYVKTKYKDHEYNCVLEDLSITCVFDETTSVCTIIAVKSADELQN-