Monarch geneset OGS2.0

DPOGS215350
TranscriptDPOGS215350-TA2613 bp
ProteinDPOGS215350-PA870 aa
Genomic positionDPSCF300120 + 500890-504777
RNAseq coverage326x (Rank: top 35%)
Annotation
Heliconius% 
BombyxBGIBMGA008115-TA1e-9535.64% 
Drosophila% 
EBI UniRef50UniRef50_D6WYA26e-11539.77%Putative uncharacterized protein n=21 Tax=Bilateria RepID=D6WYA2_TRICA
NCBI RefSeqXP_969432.22e-12242.17%PREDICTED: similar to Copia protein (Gag-int-pol protein) [Tribolium castaneum]
NCBI nr blastpgi|1892397534e-12142.17%PREDICTED: similar to Copia protein (Gag-int-pol protein) [Tribolium castaneum]
NCBI nr blastxgi|1892397532e-11736.38%PREDICTED: similar to Copia protein (Gag-int-pol protein) [Tribolium castaneum]
Group
KEGG pathwayuma:UM00214.12e-35 
 K00140 (E1.2.1.27, mmsA, iolA)maps-> Inositol phosphate metabolism
    Propanoate metabolism
    Valine, leucine and isoleucine degradation
InterPro domain[388-633] IPR0131037.4e-87Reverse transcriptase, RNA-dependent DNA polymerase
Orthology groupMCL10015 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215350-TA
ATGGCGGCAAACTACTTGGTAAATGTACCCAAATTACGCGGGCGGGAAAATTACAGCGAGTGGAGCTTCGCGGCAGAGAATTTCCTAATCCTTGAAGGCATGAAACATTGTGTAAAGCCAGAAGGAGCTGTAGTAGGAGCTGCAGACGACGAGAAAACTCGAGCAAAATTGATTATGACAATCGACCCGTCTTTGTTTGTACATGTGAAAAGTGTAAGAACAACGAAAGAACTCTGGGATAAACTACAACAGTTATTCGATGACAACGACATGGAGTCAACCATAGAGGAACGTGATGAAGTTGGCGGCGCGTTCGCAGCTCGTTCGAATTCTAAGTACAAGAAAAATAAAATGGCGTCAAGAAAAAATGTCAATGTTGGTAGCACTGCCGATACGTCAAAGTCAAACGTGACATGTTACAGATGTAAACAAAAAGGCCATTATAGAAATCAATGTACTAATAACGAAAATAACGCGTCGAACTTCAAGGAAAAACCCCGAATGCAGTCTAATGCGTTTAGTGCTGTGTTCCTGAGCGGGAATTTCAGTAAAAATGCATGGTATATTGACTCTGGAGCCAGTGTACACCTTACGGCAAATGAAAGTTTGGTTATGAATGCGTCGTATGATCAGAAACAGGAAATTATCGTTGCGAACAGTGAAAAGTTGTCAGTTTTGTGTTCTGGCGATGTGAAAATTATAACTACAACTGGTGATATTGATTACGAAATTATGGTTGAAGACGTTTATTGTGTTCCAAGTCTGGCGACTAACTTGCTATCAGTCAGCCAACTCATAAGCAAAGGAAACAAGATGGAAGTTTCCTCTCAGAGTTTTGATTCAGTGGGGGAAGAAGAAGTTAATATAGATGAACCAAACTCGGAATCTGACCACACAGATAGCTCAGAGGACACATTTCTGGATGTGGTTGATGAGACATATAAACCTAGTGATTCTGAAGCTGAAGATATCCCGCAGATTAGACCTCAAAGACCTACACGAGAGAGGAAGCAACCAGACAGGTTCAAATGTTCAAATTTTTGTGCTGGTGAGAGCACATATGATGATGTGACAGGATTGTCTCTTCAGGATGCCTTAGCCGGACCTGAAAAAGAACAGTGGAAAATAGCTATGGCTGAAGAGTTACAAAGTTTCAAAGAGAATGATGCATGGGAGATTAGCAATCCTCCTCAAGATGTTAGAGTTGTAAAGTGCAAGTGGGTGTTACGTAAAAAATATGATTGTGATAATAACATTCGGTTTCGTGCGCGTTTAGTGGCGAAAGGTTTTTCACAAGTTCAAGGTGTGGACTATACTGACACTTTCTCACCTGTAGTGAGGCATACCACATTGCGGCTTTTATTTGCTCTGTCTGTTCAACTTAATCTTGATATAACACATCTTGATGTGACAACTGCTTTCTTGTATGGAATTCTTGAAGAAGACATTTATATACAAATACCTGAAGGTTTTTCTGAGAAAGTAGAGAAAGGTCAAGTTCTTAAATTAAAGAAATCTATGTATGGTTTAAAACAGTCTTCAAGAGTATGGTACAAGAGAGTAGAGGAATGTCTTTTAAAAATTGGCTTTGTTAAATCTAAGATAGAACCTTGTATGTTTTTGAAAACACAGGATAAGTTAAAAACTATTGTTACTCTGTATGTCGACGATTTCTTCATTTTTTCAAATGATATTATAGCTACTAAGCACTTAAAAGATGTTTTATCTGACAATTTTAAAATTAAAGATTTAGGTGAAATCAAGAAATGTCTTGGAGTAAATGTAAAAGTAAATAAATGTGAGAAAACAATATCAATAAGTCAGGAAGATTATATTGATCAGCTGTTACTTAAATTCAAAATGAGTCAATGTAAAACTGTTCAAACTCCAATGGAGACTAAGTTACATGCATCTAAAGATGAGAATAATGTAGATAAGTTATTGTTTCCTTATCAACAAATGATAGGTTCTTTAATGTATTTAGCGGTTCTTACAAGGCCAGACATTGCATTTGCAGTTAGCTTTTTAAGTCAATTTAATAATTCCTATACCAAACAGCATTGTTCATATGTAAAACGCATATTGCGATATTTAAAATTGACCAAACATTATGGTTTAAAATTTTCTGCAGATGGGAACTCTGTCATTGAAGGATTTGTAGATGCTGATTGGGGTGGGAACACTATTGATAGAAGATCCTACACGGGTTTCTGTTTCACTTTGTCAGGTTGTGTAATTTCTTGGGAGACAAAGAAACAGAAGACCGTGGCTTTATCAAGCAGTGAAGCCGAGTACATGGCTTTAACTGAAGCATGTAAGGAATCTCTTTATTTAAGAAATTTACAGTTTGAAATAACTAATAAGAAGTACACTATTGAATTATATAATGATAACCAGAGTGCATTAAAGTTAACTCAAAATCCAATCTTTCATAAGAGAAGCAAACACATTGACATACGTTATCATTTTTCTAGAGAATGTGTAAATAATAATATTGTGAATGTTAAATATTTACCATCAGCTGAGATGCCAGCTGACTTACTTACAAAGAGCTTGTGCTCTAATAAGCATTATTATTTATTGGATAAGTTGGGGGTTCAGCACATATGTTAA

Protein sequence:

>DPOGS215350-PA
MAANYLVNVPKLRGRENYSEWSFAAENFLILEGMKHCVKPEGAVVGAADDEKTRAKLIMTIDPSLFVHVKSVRTTKELWDKLQQLFDDNDMESTIEERDEVGGAFAARSNSKYKKNKMASRKNVNVGSTADTSKSNVTCYRCKQKGHYRNQCTNNENNASNFKEKPRMQSNAFSAVFLSGNFSKNAWYIDSGASVHLTANESLVMNASYDQKQEIIVANSEKLSVLCSGDVKIITTTGDIDYEIMVEDVYCVPSLATNLLSVSQLISKGNKMEVSSQSFDSVGEEEVNIDEPNSESDHTDSSEDTFLDVVDETYKPSDSEAEDIPQIRPQRPTRERKQPDRFKCSNFCAGESTYDDVTGLSLQDALAGPEKEQWKIAMAEELQSFKENDAWEISNPPQDVRVVKCKWVLRKKYDCDNNIRFRARLVAKGFSQVQGVDYTDTFSPVVRHTTLRLLFALSVQLNLDITHLDVTTAFLYGILEEDIYIQIPEGFSEKVEKGQVLKLKKSMYGLKQSSRVWYKRVEECLLKIGFVKSKIEPCMFLKTQDKLKTIVTLYVDDFFIFSNDIIATKHLKDVLSDNFKIKDLGEIKKCLGVNVKVNKCEKTISISQEDYIDQLLLKFKMSQCKTVQTPMETKLHASKDENNVDKLLFPYQQMIGSLMYLAVLTRPDIAFAVSFLSQFNNSYTKQHCSYVKRILRYLKLTKHYGLKFSADGNSVIEGFVDADWGGNTIDRRSYTGFCFTLSGCVISWETKKQKTVALSSSEAEYMALTEACKESLYLRNLQFEITNKKYTIELYNDNQSALKLTQNPIFHKRSKHIDIRYHFSRECVNNNIVNVKYLPSAEMPADLLTKSLCSNKHYYLLDKLGVQHIC-