Monarch geneset OGS2.0

DPOGS211285
TranscriptDPOGS211285-TA1365 bp
ProteinDPOGS211285-PA454 aa
Genomic positionDPSCF300161 + 247444-254684
RNAseq coverage1360x (Rank: top 9%)
Annotation
HeliconiusHMEL0026580.093.42% 
BombyxBGIBMGA012503-TA9e-18077.53% 
Drosophilamub-PK3e-13870.57% 
EBI UniRef50UniRef50_UPI0001CBB8FE2e-9154.55%UPI0001CBB8FE related cluster n=1 Tax=unknown RepID=UPI0001CBB8FE
NCBI RefSeqXP_969611.27e-15971.16%PREDICTED: similar to AGAP004942-PA [Tribolium castaneum]
NCBI nr blastpgi|3838624936e-16569.62%PREDICTED: poly(rC)-binding protein 4-like isoform 1 [Megachile rotundata]
NCBI nr blastxgi|3838624931e-17871.03%PREDICTED: poly(rC)-binding protein 4-like isoform 1 [Megachile rotundata]
Group
Gene OntologyGO:00037231.4e-18RNA binding
KEGG pathway 
InterPro domain[303-367] IPR0181111.4e-18K Homology, type 1, subgroup
[299-372] IPR0040871e-17K Homology
Orthology groupMCL11705 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211285-TA
ATGGAACTGGAGAAAGCCCCCACTCTTCATGAGGAGGACTCAAAAGTTACCCTCACTATCAGACTGATTATGCAGGGGAAGGAAGTTGGCAGTATTATTGGAAAGAAGGGGGAAATTGTCAAAAGATTCAGAGAGGAGTCGGGTGCGAAAATCAACATATCAGATGGATCATGTCCAGAAAGAATTGTCACTGTCACGGGGAACACCAGTTCAATATTCAAAGCATTCACACTTATATGCAAAAAATTCGAAGAGTGGTGCTCGCAGTTCAATGAGGGCGGTGGCGGATCCCGGGCTCCGATCACATTACGGCTTATCGTGCCAGCATCACAATGTGGTTCACTCATCGGCAAAGGCGGTTCTAAGATCAAGGAGATCAGAGATGTTACCGGCCGCTTGTATGTCCATGTGGCAAGTGAGATGTTACCGAACTCAACGGAGCGTGCGGTCACCATCAGTGGTACATGTGATGCGATCACACAATGCATCTATCACATCTGCTGTGTAATGCTAGAGAGTCCTCCAAAAGGCGCGACAATCCCATACAGGCCGAAGCCCAATGTAGCAGGTCCAGTGATACTGGCCGGTGGACAAGCTTACACCATACAAGGAAACTATGCTGTGCCAGCACAAGATGCGGTGTGCGCGCCGGTGTTCCCGATGCTCGAGGTGAAGCCGCCGCTAGTGGGAGCCTTGCCGCCGGCTCACCTCCTGCCGCCGCTGGACCACCACCTCATGGGCGGTTTGGCTAAATCACCACTGGCCGGCTTAGCAGCCCTAGGTCTCGGCGGATTGGGACCCGCTAACACTGGCGGACTTAACCCAGCTGCGCTAGCAGCGCTGGCCGGTTCTCAGTTGAGGTCGTCTAACACTGGCCGCAACCAACCGGCGACCAACCAACAGTCACACGAGATGACCGTGCCCAATGAACTTATCGGATGCATTATTGGAAAGGGAGGCACCAAAATAGCTGAGATCCGTCAAATATCCGGCGCCATGATCCGGATATCGAACTGCGAGGAGCGCGAGGGAGGCAGCACGGACCGCACAATCACCATATCAGGCAACCCGGACTCCGTGGCGTTGGCGCAATACCTCATCAACATGAGTGTGGAGCTGCAGAAGGCGAATCTGGAGGGTGGCGGGTCAGGGGGGCCCCTCGCGTCCGCTATCCCGCTGGCGCAGTTGCTCAGCAAGAGCGGGGCCCTGGGGGCCCTCGGCTCGCTGTCCGCGCTCGGGGGCCTCACGGACCTCCTGGCGGGGGGGCCCGTGCAAACGACCGGGGTACACCGCCCGCACAAGCTCACACGTCGCGACGACGACGCCAAAGGATCCAAGGATAGAAACAAATATAGTCCCTACTAA

Protein sequence:

>DPOGS211285-PA
MELEKAPTLHEEDSKVTLTIRLIMQGKEVGSIIGKKGEIVKRFREESGAKINISDGSCPERIVTVTGNTSSIFKAFTLICKKFEEWCSQFNEGGGGSRAPITLRLIVPASQCGSLIGKGGSKIKEIRDVTGRLYVHVASEMLPNSTERAVTISGTCDAITQCIYHICCVMLESPPKGATIPYRPKPNVAGPVILAGGQAYTIQGNYAVPAQDAVCAPVFPMLEVKPPLVGALPPAHLLPPLDHHLMGGLAKSPLAGLAALGLGGLGPANTGGLNPAALAALAGSQLRSSNTGRNQPATNQQSHEMTVPNELIGCIIGKGGTKIAEIRQISGAMIRISNCEEREGGSTDRTITISGNPDSVALAQYLINMSVELQKANLEGGGSGGPLASAIPLAQLLSKSGALGALGSLSALGGLTDLLAGGPVQTTGVHRPHKLTRRDDDAKGSKDRNKYSPY-