Monarch geneset OGS2.0

DPOGS211147
TranscriptDPOGS211147-TA1476 bp
ProteinDPOGS211147-PA491 aa
Genomic positionDPSCF300007 - 90018-94778
RNAseq coverage1510x (Rank: top 8%)
Annotation
HeliconiusHMEL0172021e-16083.14% 
BombyxBGIBMGA003020-TA0.081.80% 
DrosophilaCG2950-PD6e-6949.36% 
EBI UniRef50UniRef50_E9ICW25e-10757.56%Putative uncharacterized protein (Fragment) n=8 Tax=Formicidae RepID=E9ICW2_SOLIN
NCBI RefSeqXP_392911.26e-10656.30%PREDICTED: similar to CG2950-PB, isoform B isoform 1 [Apis mellifera]
NCBI nr blastpgi|3072059709e-10757.22%hypothetical protein EAI_00513 [Harpegnathos saltator]
NCBI nr blastxgi|3072059709e-10558.29%hypothetical protein EAI_00513 [Harpegnathos saltator]
Group
Gene OntologyGO:00160704e-15RNA metabolic process
GO:00037237.8e-09RNA binding
GO:00054889.1e-08binding
KEGG pathway 
InterPro domain[50-132] IPR0160214e-15MIF4-like, type 1/2/3
[194-255] IPR0181117.8e-09K Homology, type 1, subgroup
[189-260] IPR0040872.2e-08K Homology
[40-121] IPR0160249.1e-08Armadillo-type fold
Orthology groupMCL16109 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211147-TA
ATGACCCGCACGGTTAAGAAACTTGAAGTGCCTAGACCTTTGAAATCAACCACCAGTTCCGCTCATAATCGCAACTCAGTATTGGATGTAAAAATTAATTCTGTTGATGACCTAATTGTTTTGACGGAAGCCGTGGCATATCAAATGCTGCAAGGAAATTTTGACCGTGCCCTTCAGAGTAATGTTGCCACAATGTATTCTAATCTCAAGCTTTATGGGGCTCAGCTCGAAGCACTGTATAAGGATTTTCTTGACAGATATTTTGTTGTTTTCCGTAACGGATCCCAAGATGAACGCCTCGATAAGAAAACTCGTCTTCATTTACTTGAACTAATAGAATTGCGTGCTAAACACTGGCAGGGTTCAGACTACATGAGTCAGTATTATAGACATCGTGGGACACATGCGGAGCCACTGCTGGTGCCCACTATGGAGGGGTCAGGGTCTGGTGGCGCGGCATCCCCTACGCTGGCTGCGCCCGCCGAGCCGCCCACACTGCTGGCCCCCGGCGAACTCATTAAACCCTCGGGCAAGTTCCCCAAGCCAACCAAGATCCCCGGCAAAAACTACTCCAAGGATGAGGTCGTCATTCGCAATGCTGACTCTGGAAAAGTGATGGGTATCAAGGGCAGGCGTGTGCACATGATTGAGGAGCTAAGTGATACGATTATATCATTTCAGAGAGTGAGTCCCGGGGCGAAAGAGCGTCTGGTGCAAATCACTGGGCCGAATGAGGAGAATGTCAATCATGCGAAGCACCTGATAGGCGACACCATCCGTCGCAACGCCTCCCCCGTCCGGCTTGAGGGTACTCTGGAACAACGAGCGCCCTCCCGCGCCTCCATAGACTCTAACGGATCGGACGACGCGCGCCCCAGGGAGAAGTCCCCTCACAATGGCAACAGAACTCTCCTCCATAGTTTCTCGACGAATGACGCAGCTTTAGGAGAATACAAGTATACCGTCACGTTTGGTCAACATTCCATCAAAATCACCGGGAACAATCTGGATCTGGTCAAGACGGCAAAGCTGGTGTTGGACGAGTATTTCGAGAGTGCGGGGGCGTTGGAGGCGATGGGGTTGGGGTCGGGGGACTTCTTCACCTTGTCCCAGAGACCGGCGGCCGCGCTGCTGCTGCGAGATGACGCCGCGCTCAACGGAGCTGACGATGACGTGTTCGCCACGGACGCGCGGCCCGAAGAAGCCACTGCAAATGGTATATGGTCCGTTTGTGCAGTGGACGAGAGCGAGACCGTGCCACGCCGCCCACGTTTCTCGCGCGCCACCTCCACCGATAAGGGCGCAGCCCGCCCCGCACCACGCTCCGCCCGCCCCGCACCCCGCGCCCTGCCGCCTCTGGCCGCCGCTGCACCACCCCAACTACTTGGCGGCACTCCTCTGCCGCAACTTCTACTAGCCCCTCCTCCTCCCCTCCTCCCCCACACTCCACTCCCCCCTCACATCATCGCCGCTTGA

Protein sequence:

>DPOGS211147-PA
MTRTVKKLEVPRPLKSTTSSAHNRNSVLDVKINSVDDLIVLTEAVAYQMLQGNFDRALQSNVATMYSNLKLYGAQLEALYKDFLDRYFVVFRNGSQDERLDKKTRLHLLELIELRAKHWQGSDYMSQYYRHRGTHAEPLLVPTMEGSGSGGAASPTLAAPAEPPTLLAPGELIKPSGKFPKPTKIPGKNYSKDEVVIRNADSGKVMGIKGRRVHMIEELSDTIISFQRVSPGAKERLVQITGPNEENVNHAKHLIGDTIRRNASPVRLEGTLEQRAPSRASIDSNGSDDARPREKSPHNGNRTLLHSFSTNDAALGEYKYTVTFGQHSIKITGNNLDLVKTAKLVLDEYFESAGALEAMGLGSGDFFTLSQRPAAALLLRDDAALNGADDDVFATDARPEEATANGIWSVCAVDESETVPRRPRFSRATSTDKGAARPAPRSARPAPRALPPLAAAAPPQLLGGTPLPQLLLAPPPPLLPHTPLPPHIIAA-