Monarch geneset OGS2.0

DPOGS200825
TranscriptDPOGS200825-TA1338 bp
ProteinDPOGS200825-PA445 aa
Genomic positionDPSCF300071 - 588898-598164
RNAseq coverage4077x (Rank: top 3%)
Annotation
HeliconiusHMEL0114730.089.87% 
BombyxBGIBMGA009879-TA1e-16293.85% 
DrosophilaCG13124-PE3e-7045.52% 
EBI UniRef50UniRef50_E3XBP96e-8045.59%Putative uncharacterized protein n=2 Tax=Endopterygota RepID=E3XBP9_ANODA
NCBI RefSeqXP_001652757.12e-8847.96%hypothetical protein AaeL_AAEL007452 [Aedes aegypti]
NCBI nr blastpgi|1571160784e-8747.96%hypothetical protein AaeL_AAEL007452 [Aedes aegypti]
NCBI nr blastxgi|1582935444e-8648.42%AGAP008763-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00160701.3e-32RNA metabolic process
GO:00054888.2e-17binding
GO:00055157.2e-16protein binding
KEGG pathway 
InterPro domain[225-442] IPR0160211.3e-32MIF4-like, type 1/2/3
[227-431] IPR0160248.2e-17Armadillo-type fold
[228-428] IPR0038907.2e-16MIF4G-like, type 3
Orthology groupMCL15931 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200825-TA
ATGTCGCGTTGTATCGTGAAGCAGTCCGTGTCGCAGCCGGTAGCGAATGTGGCCCAAGTGTCGCCGGTGTCCGTGTCGCCAGTGTCCATAGCGCCCTTAGCGGCCGTGGCCAGTGTGTCGCCCGTGGCTTCAGCTCCAGTTACACCTTTGACACCAGATGTCACAATCACAGAATCTTTATCAGACCCCGCAGGTGTCCGCACAGACGGCGTGGCGACCACGGCGGCCACGACGCCCACCACGCCCACGGCTGGGAACAAGCTGAACGTGCACGCAAAGGAGTTCACTATGGCCAAGCCGACAGACCTTCATAATAGGTCGTCTGTAGGCCTGGGGTACACGAGTGTGGGTCTGCAGCACTCGCGGTCGGTGGTGCTGCACGCGCAGGTGCCGCCACACTTCAGACCGGTGATGCCGCCGCACGCGTTACTCACGAGCGCCTCTAGCGGCAATGTGCCGCACGCCAACTCGGGCCCTCGGGTTCATTTTAAACTCCAGCCGCAGCAGATCATTCAGGAGAAGAAGAAGTCACCGCCGTCCTCGCTGTCGAACGGCAACGGGAAGAGCATCAGGCCGCAGTCGTTTACGCCGGGGCTGAAGCGCTCCAAGTCTCTGACCACGGCTGACACGCTGGCCAGTGGCATGGCAGCTCTGGGCCTGGCCGCTGACGCCGGGGACCTTGGCAACTTCCCCCCCGAGATCCAGGAATGCATTGACAAGGCACTGGAAGACCCGAACGCCGTGTGCGCTCGTACTCTGATGGATGCCGTGGGTCACCTGATCTCCCGCGCGGTGGAGTCTCCGCGGTACGCGCTACCAGCAGCTCGGCGATGTATCGCGGTAGTGGAGAGAGAAAACACGGAGACGTTCCTCGAGTCTCTGTTGAACACGTGCCAGCAGTGGTACCACGACAGGGATAAGTTGCTGGGTGCTGTGGTGAGCGGCGGCCGTCCTCGTCTGATGGCCTTCCTGTCGTTCTTGCTGGAGATGTACTGCCAATTGCGGCGGCGGGCCATCCAGCGCCGCGGACACAGCGCGCCGGGACACGTGCTGCTCGCGCTCATCTGCAAGTGCTGTGAGGACTGCATCCGACAACCGGTGCCTTCACCCAGCGACACGGAGAATCTGTTCTTTGTCCTGACGTACATTGGCCGCGATCTCGAAACGCAGCTGCCAGGTGACCTGGAGCGCCTGCTGTCGGCCGTCCGTGACGCGTTCATCAACACGGCCGCCGCCCCCTCCATACGACGCACTCTACTCCAGCTGATCGAGCTCCACGCGTCTCGCTGGCAGCTGCCCGGCTGTGCGGTGCTCTACTACTACCCCTCCTCCAAGTAG

Protein sequence:

>DPOGS200825-PA
MSRCIVKQSVSQPVANVAQVSPVSVSPVSIAPLAAVASVSPVASAPVTPLTPDVTITESLSDPAGVRTDGVATTAATTPTTPTAGNKLNVHAKEFTMAKPTDLHNRSSVGLGYTSVGLQHSRSVVLHAQVPPHFRPVMPPHALLTSASSGNVPHANSGPRVHFKLQPQQIIQEKKKSPPSSLSNGNGKSIRPQSFTPGLKRSKSLTTADTLASGMAALGLAADAGDLGNFPPEIQECIDKALEDPNAVCARTLMDAVGHLISRAVESPRYALPAARRCIAVVERENTETFLESLLNTCQQWYHDRDKLLGAVVSGGRPRLMAFLSFLLEMYCQLRRRAIQRRGHSAPGHVLLALICKCCEDCIRQPVPSPSDTENLFFVLTYIGRDLETQLPGDLERLLSAVRDAFINTAAAPSIRRTLLQLIELHASRWQLPGCAVLYYYPSSK-