Monarch geneset OGS2.0

DPOGS202054
TranscriptDPOGS202054-TA1119 bp
ProteinDPOGS202054-PA372 aa
Genomic positionDPSCF300053 + 836983-838101
RNAseq coverage82x (Rank: top 64%)
Annotation
HeliconiusHMEL0167747e-11554.42% 
BombyxBGIBMGA012570-TA4e-8844.83% 
Drosophilawuho-PA2e-3126.84% 
EBI UniRef50UniRef50_D6WD962e-4833.83%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WD96_TRICA
NCBI RefSeqXP_001604066.12e-4034.21%PREDICTED: similar to Im:6900357 protein [Nasonia vitripennis]
NCBI nr blastpgi|2700030428e-4833.83%hypothetical protein TcasGA2_TC000064 [Tribolium castaneum]
NCBI nr blastxgi|3504192214e-5135.41%PREDICTED: tRNA (guanine-N(7)-)-methyltransferase subunit WDR4-like [Bombus impatiens]
Group
Gene OntologyGO:00055153.4e-22protein binding
KEGG pathway 
InterPro domain[29-326] IPR0110463.4e-22WD40 repeat-like-containing domain
[39-283] IPR0159431e-18WD40/YVTN repeat-like-containing domain
Orthology groupMCL14131 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202054-TA
ATGTCCTGTCTTGCTGTGAGTGACGATATTGTCGCAGTTTCAAATGCATTCTATATAGACTTTTTTTATAAATCAGATAATGTTATTAATAAATTAACAATCGATCCTTCACCTTCAGAGGATGAAACAGTAACGGACCTTATAATATCATATGATAGTAAGTATTTAGCCGTACTCTTGTCTCTTTCTAAGAGGGTAATAATTTACGAGCTACAACATATGCAACAAATTTTAAACATTACACTACCACGAAGAGCTAGTAAGATAAGATTTAACATAAGCAATACACAAATATTTGTAGCTGATAAGACTGGTGATGTTTTATCTTATGATATACCATTAGGGAACAATGGTATCAAAATCCTGGGACATTTAAGTCTTCTTCTAAATGTCTTACAAACTGATGACTCAAAGTATCTGATTTCATGTGACAGAGATGAAAAGATCAAAGTGTCTTGCTATCCAAACACATATAATATACAAACATATTGCTTGGGTCACAAGGAGTTTGTTAATCACATAGAACTCTTGCCACACCTAACAGAATATTTGACAAGTAGTTCCGGAGATGGAACTGTGAAAATTTGGAATTATGTAAATGGTATCTTAATGCACACTATTGATACCGGTAATGACTTAGAGAATACTGAGTTACAAGAAAAATTTTGTAAAACCATGGATGAAGATGGAATAGAAGTTTCATCATTACCCATAGTACATTATTCAATATCAAAATATGATGAGAACTCAAGCATATTAGCAATTGCAGTACATACATATATGAAAATTATGATATACAGATTACAATCACAAAATAACAAATTCAGCCACAATTTAATATCAGAACTGTCAGTCGATCAATTTCCTTTTGCTATTAAAGTTAGCAACTTATCCTTATATGTATACGACAATGCTGATTTTAAAATTAAGGAGTACCATATCCTTCATAAGAATGAGAATATAATAATCGAATTTGCAAGGGATATTCAAGTATTTAAAGAACAAGATTGTAAGTCTGAAAAATATGATCCAGATGCAATCAAGGTTTTATTTAAAAGAAAATTTGATAATGTACAGGAATACCAGGAAAGGAAGAAACTAAGATTAGAGAAATCTTAA

Protein sequence:

>DPOGS202054-PA
MSCLAVSDDIVAVSNAFYIDFFYKSDNVINKLTIDPSPSEDETVTDLIISYDSKYLAVLLSLSKRVIIYELQHMQQILNITLPRRASKIRFNISNTQIFVADKTGDVLSYDIPLGNNGIKILGHLSLLLNVLQTDDSKYLISCDRDEKIKVSCYPNTYNIQTYCLGHKEFVNHIELLPHLTEYLTSSSGDGTVKIWNYVNGILMHTIDTGNDLENTELQEKFCKTMDEDGIEVSSLPIVHYSISKYDENSSILAIAVHTYMKIMIYRLQSQNNKFSHNLISELSVDQFPFAIKVSNLSLYVYDNADFKIKEYHILHKNENIIIEFARDIQVFKEQDCKSEKYDPDAIKVLFKRKFDNVQEYQERKKLRLEKS-