Monarch geneset OGS2.0

DPOGS204840
TranscriptDPOGS204840-TA1722 bp
ProteinDPOGS204840-PA573 aa
Genomic positionDPSCF300227 - 197870-202282
RNAseq coverage205x (Rank: top 46%)
Annotation
HeliconiusHMEL0138961e-11268.79% 
BombyxBGIBMGA011735-TA0.068.07% 
DrosophilaCG10948-PC5e-7042.39% 
EBI UniRef50UniRef50_E0VIU72e-9539.00%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VIU7_PEDHC
NCBI RefSeqXP_002426041.13e-9639.00%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420105766e-9539.00%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|910929282e-10540.24%PREDICTED: similar to ecto-NOX disulfide-thiol exchanger 2 [Tribolium castaneum]
Group
Gene OntologyGO:00001661.3e-09nucleotide binding
GO:00036761.7e-08nucleic acid binding
KEGG pathway 
InterPro domain[106-184] IPR0126771.3e-09Nucleotide-binding, alpha-beta plait
[122-175] IPR0005041.7e-08RNA recognition motif domain
Orthology groupMCL11019 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204840-TA
ATGATGATGAGTGGCATGTTCAATATGATGATGCCCAATCCAATGATGGCTGGAGGGATGATGCCTTCGAGCGGAATGGAGATGATGCAGAGCACCAGTATGGAGATTATTTCACAGCCACCAATAGACATAACCGCTATGGGAACACAACCCTTAGCCGCTCCGCCTTTGACACCATCAATGGACATGAGTATGATGGGTGGTGTTATGATGGATCCATCCATTATGGGTATGTTTCCAAACATGAGTAACGAAATTGTGCCAGAGAAGAAGGAAATTGTCCTTAAGCATAGTAAGCTAGTGGCACCAGCCCCAGGAACCCCACAACCTCCTCGAAGGACCAGGCCTCCGGGGTGTCGCACAATCTTTGTTGGTGGCTTACCAGAAAAAATAAGAGAGAATGCTGTAAGAGAAATATTTGAGCGTTATGGAAGAATACAAATATTAAGGTTGTCCAAGAAAAACTTTTGTCACATACGTTTTGATAGAGAGAGTTGTGTGGATGCAGCCATGGTTATATCAGGATATAGGATCAAAGTTCTGAACAAGGAAGGTGATAAGGAAGAAGACGATAATAATGCCTCAAGTGGCTGGCTTCACGTTGATTACGCTTTGAGTAGGGACGATCAGAACGAGTATGAAAGACGCCAGCGTCTAGCGCTGCGTGTTCAACAACAACAAATGCAACAGATGTCCGCACAACACGAGGCTATGAGTAACAGAAACGCCACGTACAGAAGATCTCCCTCACCCCTCAGGATACAACCCTTCTCAGCGACAGCCATCGTACAGTTGACAGAGAAAATTAAAAGCGAAGAACATTTCGCCACCACACTTCCCACACTGATATCGTGGTTGGAACGCGGGGAGTGTTCCAAAAAGAGCGCCAATCAATTCTACTCTATGATCCAAGCGACCAATTCTCACATCAGACGACTTTTCAATGAAAAAATGCAGGCTGAAGAGGAATTACAGGAATGCAAAGATAGAGTGAAAAATAATATACAGAATGTCATCGAGCAACTCGAACAGGTGGGGAAAGTGTTTAACGCAGCGACCCACCAGCGTGTCTGGGACCACTTCACTAAACCACAAAGGAAGAATATAGAAACGTGGCAGAAAATGACGCAGGAGTTCAATACATTGAAGGAGGAGTTCAGTGAGAGGTTCTTTAACGATGACTCGGAATACAATGGCACGAGCAAGGGCAGTTACGATAGCAATAATGAAGACATAAACCAGTTGAAGCGTGAAAATGAGAGTTTACAATTCCAATTGGAAGCGTACAAAAACGAGGTGGAAGTTATAAAGAACGACGCGCAAAAGGAAATGGAGAAATTTAAGGCACAATTTATAGCGCGACAAGCATTGTTAGGGGCAATGGAAAACAAACCTCCCCTACCGTCGTCAGTATCAGAGCAGCCCCCGCCTCCCCCTCCGCTGCCTGATGACGCGGATGACTCCCTGAAGACGGCCGCCGAGGTCGCGCCAGGGGAGGCCAGGCTTATAGGAGTCATGTCAGCCTTCTTACAGGTACACCCCCGGGGAGCCAGCCTGGACTACGTGGTGTCCTATGTACGAGCGTTATTCCCAAACGTAACACAAGCAATAATACACCATGTACTGCAGAAGTACGAAGACGTGTTCCAGAAAACCACCAGCGGGGTCGGAGCCAATATTGAGAACCGTTGGACCTTTGTAGCGTTTAATAATAAGACGTAG

Protein sequence:

>DPOGS204840-PA
MMMSGMFNMMMPNPMMAGGMMPSSGMEMMQSTSMEIISQPPIDITAMGTQPLAAPPLTPSMDMSMMGGVMMDPSIMGMFPNMSNEIVPEKKEIVLKHSKLVAPAPGTPQPPRRTRPPGCRTIFVGGLPEKIRENAVREIFERYGRIQILRLSKKNFCHIRFDRESCVDAAMVISGYRIKVLNKEGDKEEDDNNASSGWLHVDYALSRDDQNEYERRQRLALRVQQQQMQQMSAQHEAMSNRNATYRRSPSPLRIQPFSATAIVQLTEKIKSEEHFATTLPTLISWLERGECSKKSANQFYSMIQATNSHIRRLFNEKMQAEEELQECKDRVKNNIQNVIEQLEQVGKVFNAATHQRVWDHFTKPQRKNIETWQKMTQEFNTLKEEFSERFFNDDSEYNGTSKGSYDSNNEDINQLKRENESLQFQLEAYKNEVEVIKNDAQKEMEKFKAQFIARQALLGAMENKPPLPSSVSEQPPPPPPLPDDADDSLKTAAEVAPGEARLIGVMSAFLQVHPRGASLDYVVSYVRALFPNVTQAIIHHVLQKYEDVFQKTTSGVGANIENRWTFVAFNNKT-