Monarch geneset OGS2.0

DPOGS208793
TranscriptDPOGS208793-TA1182 bp
ProteinDPOGS208793-PA393 aa
Genomic positionDPSCF300036 - 462272-463699
RNAseq coverage58x (Rank: top 69%)
Annotation
HeliconiusHMEL0049158e-15466.84% 
BombyxBGIBMGA007654-TA5e-15865.39% 
Drosophila% 
EBI UniRef50UniRef50_UPI00022472BB5e-2829.48%UPI00022472BB related cluster n=1 Tax=unknown RepID=UPI00022472BB
NCBI RefSeqXP_001604001.13e-2530.71%PREDICTED: similar to LOC553388 protein [Nasonia vitripennis]
NCBI nr blastpgi|3454913362e-2729.48%PREDICTED: rhabdoid tumor deletion region protein 1-like [Nasonia vitripennis]
NCBI nr blastxgi|3454913365e-2929.45%PREDICTED: rhabdoid tumor deletion region protein 1-like [Nasonia vitripennis]
Group
Gene OntologyGO:00054881.7e-24binding
KEGG pathway 
InterPro domain[20-305] IPR0160241.7e-24Armadillo-type fold
[40-215] IPR0119891.4e-18Armadillo-like helical
Orthology groupMCL20465 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208793-TA
ATGCAACCGTCCCTTTATGCACCCCATGTGGATATAACCAGAGCGTCCTTAGGTTTTGAACGTGTTGGTCTTAGACTAATTAACAGGGACTTACATTCGCCCGATCATTTGAAGCAATTACAGGCAATCCATAGTATTTTAGACCAGGTACAAATTTCAGAAAACGCTTTGTTTCTCATAGATCTCCAAGTTGTCTATCGACTGATTGACCTGATGGTGCATAAAAATCCGGTGATACGGGAAAAGGTCTGCATCATACTATCATCGCTATGTAACTTTTATCAAGGAAGGAAGCAGATGATGACAAAACTCTCCGTCGTCGAGAATCTCATTTGGCTGATAATGCGAGACCGCAGAGAGATAAGGTACGCAGCAGCATACACCTTGAGGTGTTTAGCCAGGGACAGATGTTCCAGCGAATACATTCTACAAGACGAAAAGATTATTGAAAATTTGCTTAAAATGATAAAACACGAACACGTGGGCATCGTCGTATTGCATTTGAAGACGTTGGAACATCTCTTCGAATGGGACCAAGAGAGGCCGCTGAAAGCGAACGCCTTCAGACTTATGGTGAAGTTGTTCGAAAGCAAAGATCCTAGAATAGTGAGCGGTGCGATGGATTGCTTGACGCAGCTGTGCAAACACGATGTAGGGAAAAAGGTAGCCGACGTGTACGACTTGACGTTCACGCTGAAACCTTTCCTGATGTCGTCGGCCCTGGAAATCAAAATAAGCGCAGTCGGCCTCATGGAGTACACAACCGTGACGACCCGCTCCAAATGGAGGGCGAAGGAATGTTGTGTGGACCTGACCAAGCGTCTAGTGGTGTTGTGCCACTGCCCAAACATACCTCTCCTTCAATTGAGGAGCATGCAAGTACTGATCAATCTCTGCGACTGTCCCGACATCAGACATCACATAAAGATGCACTGGGAGAAGAAAATTGAGGCCATAAAAATACGAAGCCACGAACAATGGGACGGCACTTCGGAGACCACCAGCTACTGCTTCGAGACCGGCCATAACTATAGAACCATGTGTGTCGAGGGAGTTGAAACCATAAAAAACGATTTTGGGGACAACGCACACGTCGTCAACGTTCACAGCTATTTAAGACGTCTGCATGAAAAGAAATCGCAGCTCCTCTACGCTATCAACTGGAAGTCCTACAGAGATTAA

Protein sequence:

>DPOGS208793-PA
MQPSLYAPHVDITRASLGFERVGLRLINRDLHSPDHLKQLQAIHSILDQVQISENALFLIDLQVVYRLIDLMVHKNPVIREKVCIILSSLCNFYQGRKQMMTKLSVVENLIWLIMRDRREIRYAAAYTLRCLARDRCSSEYILQDEKIIENLLKMIKHEHVGIVVLHLKTLEHLFEWDQERPLKANAFRLMVKLFESKDPRIVSGAMDCLTQLCKHDVGKKVADVYDLTFTLKPFLMSSALEIKISAVGLMEYTTVTTRSKWRAKECCVDLTKRLVVLCHCPNIPLLQLRSMQVLINLCDCPDIRHHIKMHWEKKIEAIKIRSHEQWDGTSETTSYCFETGHNYRTMCVEGVETIKNDFGDNAHVVNVHSYLRRLHEKKSQLLYAINWKSYRD-