Monarch geneset OGS2.0

DPOGS211402
TranscriptDPOGS211402-TA1542 bp
ProteinDPOGS211402-PA513 aa
Genomic positionDPSCF300115 - 166677-168552
RNAseq coverage19x (Rank: top 80%)
Annotation
HeliconiusHMEL0098590.075.38% 
BombyxBGIBMGA010862-TA0.071.52% 
DrosophilaCG17838-PB8e-7848.25% 
EBI UniRef50UniRef50_UPI0001CBAAE51e-10158.47%UPI0001CBAAE5 related cluster n=3 Tax=unknown RepID=UPI0001CBAAE5
NCBI RefSeqXP_002736660.12e-10258.47%PREDICTED: apobec-1 complementation factor-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|472096682e-10558.86%unnamed protein product [Tetraodon nigroviridis]
NCBI nr blastxgi|3272740336e-10543.37%PREDICTED: probable RNA-binding protein 46-like [Anolis carolinensis]
Group
Gene OntologyGO:00001668.4e-19nucleotide binding
GO:00036762.1e-16nucleic acid binding
KEGG pathway 
InterPro domain[10-315] IPR0065355.5e-139HnRNP R/Q splicing factor
[26-116] IPR0126778.4e-19Nucleotide-binding, alpha-beta plait
[49-114] IPR0005042.1e-16RNA recognition motif domain
Orthology groupMCL20530 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211402-TA
ATGGTGGAGACTAACAACCAGATGTCTTACAAACTGTTAGAGCTGATGCAGAGGACCGGCTACACTATCGCACAGAGAAACGGACAAAGGAAGTTCGGCCCACCGCCCAACTGGACCGGACCGCCGCCGCCCAGAGGATGCGAAGTGTTCGTCGGTAAGCTGCCAAGAGAAATATTCGAAGACGAGCTCGTCCCGCTGTTCTCGAGAGTCGGCAAGATATACGAGATGCGACTCATGATGGATTTCTCCGGCTCGAACCGCGGGTACGCATTCGTAATGTACACTGAACGGGCAGAGGCCACGGCGGCCGTGAAGCAACTCAACGGCTACGAGATACGCCCGCGACGACACATCGGAGTCGTAAAGTCCGTGGATAACTGTCGCCTGTTCGTAGGAAACATTCCCAAGACGAAGACCAAGGAGGACGTCAGGGAGGAGCTCTCCAAGCGCGTATCGGACATTGTGGACGTCATACTGTACAAGAACTGTTTTGATCGAAAGCTGAACCGGGGCTTCGCCTTCGTGGAGTTCACGTGTCACCGCGCAGCCGCTATGGCGAGGCGGGCGCTGGTGCCGGGCTGCGTGAGACTCTGGGACCAGGAAGTCATGGTGGATTGGGCTGAACCAGAACCTGACATAGACGACGAGCAGATGCAAAGGGTGAAGGTACTGTATGTGCGTAATTTCGAGATTCGGACGACCCCGGACGTGATCCAAAAGGTTTTCGAATCCACCATTAACCACAAAGTGGAACGAGTAAAAAAAATATATGACTATGCGTTCATACATTTTTACGAAAGAGAACACGCAGAGCTTGCTATAGCGAAATTACAAAACGCAGACATCGACGGTAGTAACATTGAAATAAGATGGGCCAAGCCCGTCGACCGCGACCTGTACCGCATACAGAAACTCAGTAGAGGCAATGCGAAGTTTAACAATAGCTTGGATTTGACTCAAACCCTGTTGCTTTATAAACATCACATCGAAAAGCAGGAGTACGCAAACGGTCCCAAGGACGAAGGCATCGGCTCGGCCTGCGCTGGGGGCAGTTCCTGTTGCTCTCCGACAGACGCAAAGCCTCCAGTGTATTATCAGGCACCAGCTAACTATATTCTGGCAACAGTTAAGCTGGAATCCATGTGCAAACGCTACATGTGGTCTCCACCAGTATATGACTATCAAAAGTATGTGGATCCTACTGGCACAGAACTATGGGTGTGCCGTGTGGAGCTGCCTCAAGTGGGTCTGCCGCTGTTGTCTGTTGCGCAACGTGTGGGTCCCCTTGCAAGCCGAGCATGCTTCTCTCTTGAACAAGCTCATGTCGAGGCAGCAGAATTGGCCCTGCAGGCCTTGAAGATGTTGAGAGTGGACCTGATCCACCAGTCTCCTGGTTCCCTGTACTCCATGCCTGCAGCTTGTGTCGGTCTCCCTTGTGACTACCCGGTGTACTCGCTGGCGCCTGCTATTCCTGCTACACTACCTGGTTCCTTGCCAGCTCCACTTTCTGCTCCAATACCAACTATTTGGCGTACTGTTTAG

Protein sequence:

>DPOGS211402-PA
MVETNNQMSYKLLELMQRTGYTIAQRNGQRKFGPPPNWTGPPPPRGCEVFVGKLPREIFEDELVPLFSRVGKIYEMRLMMDFSGSNRGYAFVMYTERAEATAAVKQLNGYEIRPRRHIGVVKSVDNCRLFVGNIPKTKTKEDVREELSKRVSDIVDVILYKNCFDRKLNRGFAFVEFTCHRAAAMARRALVPGCVRLWDQEVMVDWAEPEPDIDDEQMQRVKVLYVRNFEIRTTPDVIQKVFESTINHKVERVKKIYDYAFIHFYEREHAELAIAKLQNADIDGSNIEIRWAKPVDRDLYRIQKLSRGNAKFNNSLDLTQTLLLYKHHIEKQEYANGPKDEGIGSACAGGSSCCSPTDAKPPVYYQAPANYILATVKLESMCKRYMWSPPVYDYQKYVDPTGTELWVCRVELPQVGLPLLSVAQRVGPLASRACFSLEQAHVEAAELALQALKMLRVDLIHQSPGSLYSMPAACVGLPCDYPVYSLAPAIPATLPGSLPAPLSAPIPTIWRTV-