Monarch geneset OGS2.0

DPOGS205145
TranscriptDPOGS205145-TA1083 bp
ProteinDPOGS205145-PA360 aa
Genomic positionDPSCF300246 + 67796-76821
RNAseq coverage492x (Rank: top 25%)
Annotation
HeliconiusHMEL0026037e-12991.53% 
BombyxBGIBMGA008139-TA6e-11996.23% 
Drosophilasm-PK4e-8158.24% 
EBI UniRef50UniRef50_D6WHW64e-8958.93%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WHW6_TRICA
NCBI RefSeqXP_001809748.11e-8958.93%PREDICTED: similar to smooth CG9218-PG [Tribolium castaneum]
NCBI nr blastpgi|2700035992e-8858.93%hypothetical protein TcasGA2_TC002855 [Tribolium castaneum]
NCBI nr blastxgi|2700035992e-8658.93%hypothetical protein TcasGA2_TC002855 [Tribolium castaneum]
Group
Gene OntologyGO:00001666.8e-10nucleotide binding
GO:00036766e-06nucleic acid binding
KEGG pathway 
InterPro domain[111-213] IPR0126776.8e-10Nucleotide-binding, alpha-beta plait
[111-180] IPR0005046e-06RNA recognition motif domain
Orthology groupMCL13789 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205145-TA
ATGTCAAGTATAATGGCGGTGTCGTCAGGCATTCCAGTAAAAGGCTATTATTCCACCACATCGACCTCGCTACCGAGGTCATGCAGCGAGGGGACATGCAGTGAACACTGCCAGAACACCTCCGACCAGAACGATATGGATTTCGGTAGTGGTGGACCACCTCCGCCGGGGATGATGAAAGATCGACACGGTCCTCCCCCTCCACATCATCCGCGCGATGGTCGCGGCTACGGTCCAGAGGGTTACGGGGGCGAGGGCTTCGTCCCGCCTCCGCCCATGCACCACCCGATGCCGCCGCAGAATCAGGGAGGTCCTCCACAGCAGGGCGCTGTGATGATGGTGTACGGTCTCGATCCAGTCACTATCAACGCTGACCGACTCTTCAACCTGTGCTGTCTCTATGGAAACGTCGTTAGAATAAAATTCCTAAAAACAAAGGAAGGCACTGCGATGGTTCAAATGGGTGATGGCGTGTCCGTTGAGCGTTGTGTCCAAAACTTGAACAACGTTACCGTCGGGGACTACACACTTACGCTCGCGTTTTCTAAACAGGCGTACCTATCAGAGGTGATGAACCCCTACCCTCTTCCTGACAAGAGTCCCTCGTTCAAGGATTACGTCGGCAACAAGAACAACAGGTTCCTGACACCTGCCTCCATACACAAGAACAGAATACAACCTCCGTCCAAGGTGGTACATTTCTTCAACACCCCACCGGATGTGTCAGACGAGCAGCTGTTGCAGGTGTTCCACGACTACGGGGTCACGCCGCCGCATACAGTGTCCAAATTCCCGCTCAAGAGTGAGAGGTCCTCGTCGGGGCTGATGGAGTTCCAGAACATCTCTCAGTCTGTGATGGCCATCATGGCCTGCAACCACGCGACCATACAGCATCCAGGTGCTAAGTTCCCGTTCGTTATGAAGCTGTGCTTCTCGTCGTCACGTCAGATCGGTCAGAAGGGGCAGGGGGACAGGCAGGGGAAGGGTCAGACCGGACAGGCGGGTCAGGCGGGTCAGACTGGCCAGGCGGGGCAGGCGGGTCAGACCGGTCAAAACAACGGCATCGACGATGACTACTACTAA

Protein sequence:

>DPOGS205145-PA
MSSIMAVSSGIPVKGYYSTTSTSLPRSCSEGTCSEHCQNTSDQNDMDFGSGGPPPPGMMKDRHGPPPPHHPRDGRGYGPEGYGGEGFVPPPPMHHPMPPQNQGGPPQQGAVMMVYGLDPVTINADRLFNLCCLYGNVVRIKFLKTKEGTAMVQMGDGVSVERCVQNLNNVTVGDYTLTLAFSKQAYLSEVMNPYPLPDKSPSFKDYVGNKNNRFLTPASIHKNRIQPPSKVVHFFNTPPDVSDEQLLQVFHDYGVTPPHTVSKFPLKSERSSSGLMEFQNISQSVMAIMACNHATIQHPGAKFPFVMKLCFSSSRQIGQKGQGDRQGKGQTGQAGQAGQTGQAGQAGQTGQNNGIDDDYY-