Monarch geneset OGS2.0

DPOGS202463
TranscriptDPOGS202463-TA1191 bp
ProteinDPOGS202463-PA396 aa
Genomic positionDPSCF300174 + 293142-298309
RNAseq coverage1915x (Rank: top 6%)
Annotation
HeliconiusHMEL0164035e-16974.82% 
BombyxBGIBMGA009981-TA7e-16372.14% 
DrosophilaCG8963-PC4e-2228.03% 
EBI UniRef50UniRef50_E9GHY64e-2731.15%Putative uncharacterized protein n=1 Tax=Daphnia pulex RepID=E9GHY6_DAPPU
NCBI RefSeqXP_002138474.11e-2430.50%GA24791 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|3214699902e-2631.15%hypothetical protein DAPPUDRAFT_102922 [Daphnia pulex]
NCBI nr blastxgi|3214699903e-2729.79%hypothetical protein DAPPUDRAFT_102922 [Daphnia pulex]
Group
Gene OntologyGO:00160702.6e-18RNA metabolic process
GO:00054885e-13binding
GO:00055151.2e-05protein binding
KEGG pathway 
InterPro domain[120-307] IPR0160212.6e-18MIF4-like, type 1/2/3
[86-306] IPR0160245e-13Armadillo-type fold
Orthology groupMCL16329 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202463-TA
ATGAACAACGGGGACGTCGGAGGCTCTAGAGGCAGAGGCCGTGGTTGGAATTCCGACAACCAACCTCGCGAATTACGCCGGCCCAAAACTGTAACTGAGGAAGTGAAAGAACCTCCAAAGTCAATACTGTCAGCTGAAGCGAAAGAATGGTATCCGCGGAACTACGTGCCCCAGAACCAAGTGTCGTATGGGCAGGAGCACTACCGAGTGCCTCGCTATTCCGCTCAAGATAGAATCCGACAGGCTCAGGAACAGGATCCGTACAACTTTGAGGATATGTCATATTCTTTGGATGAACCCGAAAGTGCCTTACGGGAAAATATAGCAAACCTGATTACTGTTATGTGTGAGATAACATTTGACCCAGGCAAGTTTGACACTCTCTGTGGACCTCTGGTAGATTCATTTTATGCCACTCTACATGATGCTAACTACACCAGGCCTCTTGTCGAAGCTATCGTGAATCAGTCAATATTCGAGGCCAACTTCCGCTACAATGGCGCCCGTCTTTGCTCGATGTACGACTCCGTCTCGCCTCCCGAAGACTCAACATTCCGAGCCTGTCTGTTGGAACGTTGTACTGCCGAGGAGAACAAAATCATAAGTGGGGCAGAAACATCGGAGGAGAACGTCAGAGGTTTTGCTATGTTCCTGGCTGAGATATACACACAGCTGGAGGACAATCAGGGAGGAAGAATAAGGACTCTGGGTGAAAGTCTCTGTAAAGTGTTCTTGCATCTTTTGGACACCGACAAAGAGGTCAACATAAAAGCGGTATGCCAGTTGTTGAAATTGTCCGGTATAGCGCTGGACGCGGATTGTCCGTCTAGCATGCAGCAGCTGTTCGATCGCTTGAAGCAACGTTCGGATCTGGCGAGCGTGCGTCACGTGGTGTCGCTGAGGGCAACCCGTTGGGGTCTGGCTGACCCCGACCCGCCGGCCCCGCCCGCTGACAGACGACGAAACGCTAACTCCGAAGCCGACGGTGTTGGAGGTTATCTCGCAGACGGACATTCGCTAACTGCCGAGGAATGCGCCTTCTTGCAAAGCAACCTACCCCCAAAACCCGCGGCTATAGAGGACGACATACTTGAGGAATTGGAAAATGATGCATGGGATACTGGCATGGATCCGGAAATGCAGGCGGGCTTCCTAGAGTTCCTCAAGATATCCAATCAAATCAAACGATAG

Protein sequence:

>DPOGS202463-PA
MNNGDVGGSRGRGRGWNSDNQPRELRRPKTVTEEVKEPPKSILSAEAKEWYPRNYVPQNQVSYGQEHYRVPRYSAQDRIRQAQEQDPYNFEDMSYSLDEPESALRENIANLITVMCEITFDPGKFDTLCGPLVDSFYATLHDANYTRPLVEAIVNQSIFEANFRYNGARLCSMYDSVSPPEDSTFRACLLERCTAEENKIISGAETSEENVRGFAMFLAEIYTQLEDNQGGRIRTLGESLCKVFLHLLDTDKEVNIKAVCQLLKLSGIALDADCPSSMQQLFDRLKQRSDLASVRHVVSLRATRWGLADPDPPAPPADRRRNANSEADGVGGYLADGHSLTAEECAFLQSNLPPKPAAIEDDILEELENDAWDTGMDPEMQAGFLEFLKISNQIKR-