Monarch geneset OGS2.0

DPOGS207287
TranscriptDPOGS207287-TA1377 bp
ProteinDPOGS207287-PA458 aa
Genomic positionDPSCF300008 + 280097-290942
RNAseq coverage676x (Rank: top 19%)
Annotation
HeliconiusHMEL0163031e-10672.56% 
BombyxBGIBMGA012019-TA0.073.95% 
DrosophilaCG7966-PA1e-16660.50% 
EBI UniRef50UniRef50_Q9VFZ42e-16460.50%CG7966 n=16 Tax=Arthropoda RepID=Q9VFZ4_DROME
NCBI RefSeqXP_970689.12e-17161.05%PREDICTED: similar to selenium-binding protein [Tribolium castaneum]
NCBI nr blastpgi|910920644e-17061.05%PREDICTED: similar to selenium-binding protein [Tribolium castaneum]
NCBI nr blastxgi|910920642e-17261.05%PREDICTED: similar to selenium-binding protein [Tribolium castaneum]
Group
Gene OntologyGO:00084302.6e-273selenium binding
GO:00055156.6e-06protein binding
KEGG pathway 
InterPro domain[3-455] IPR0088262.6e-273Selenium-binding protein
[294-388] IPR0159436.6e-06WD40/YVTN repeat-like-containing domain
Orthology groupMCL12183 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207287-TA
ATGGCGTGTTGTAAGGGTCCGGGATACGCAACCCCACTAGACGCATTCCGTAACGGCCCCAGGGAGGAGTTACTTTACGTTGTGTGTGTACAACCAGACCTCACTAAACAGGACTACCTTGCTACTGTGGACGTGGATCCTAAATCACCCACTTACAGCCAGGTAATCCATCGCACGTACACTGGCAGCGTTGGGGACGAATTACATCACAGTGGATGGAATGTATGTTCAAGTTGCTATGACAATCCAGATCTGAAGAGGAATCTCCTTATTCTGCCAGGACTGCATTCCTGCAAAGTTTTTGCTGTTGATGTTGGCACCGATCCCCGAAAACCAGAACTATATAAGGTGATTGACGGCTCTGAGATGAGGTCATTCAATTGTTCATTTCCTCACACGACACACTGTCTAGCAAGCGGGGATATTATGATTTCAACGATGGGAGATAAAAACGAGGACGGAAAGGGGGATTTTGTGCTCATTGACTCCAAAACTTTGAAAGTGACAGGGACATGGACGAGAGGTGAAAAAATTGCTAAATTCGGTTATGACTTCTGGTATCAACCCTACCACGATGTTATGATTTCTTCTGAATGGGGATCCCCTAAACACTTTAAATCTGGTTTTCATCCGGGTGACATTCCCAACTCGGAGAGATATGGAACTTCTCTTAATGTTTACAAATGGTCTACCCGGGTTCTAGAACAAGTCATAGATTTGGGTAACGAAGGGTGTGCGCCCTTAGAGATCAGATTCCTTCATGATCCTAAATCCGAAGTGGGATTTGTAGGGTGCGCCGTCTATGCGAATGTGTACAGATTTTACAAATCAAATGAAGGAAAATGGAAAGCTGACAAAGTTATTGATATACCAGCAAAAAAAGTTATCAAAGATGGAAAAGAAACCTTGATAAATGGATTAATATCCGATATTTTGTTATCCTTGGATGACAAATATTTATACATTTCCTGTTGGCTCCATGGTGAGGTTAGACAGTACGACGTTTCCGATCCAAAGAAACCAAAACTTACAGGAAAAATACTTTTGGGTGGAGAAATTGAAGTGCCGCAACTTGTTACAGTAAAAGGTAAAAAATTATACGGCGGCCCTCAGATGTTGCAACTCTCATTAGATGGGAAACGTCTTTATGTATCTTCATCACTTTACTCCCCTTGGGATAAGCAGTTCTATCCTAAAATGGTTGATCAGGGAGGTTGGATAGTCAAGTTGGACGTCGACACGGTTAACGGAGGCATAAAATTGGATCCTGATTTTCTTGTAGACTTTGGGAAGGAACCGAACGGACCGGTTATACCGCATGAAATGAGGTATCCTGGAGGAGATTGTACTTCTGACATTTGGCTGGCTGAAAAGTAA

Protein sequence:

>DPOGS207287-PA
MACCKGPGYATPLDAFRNGPREELLYVVCVQPDLTKQDYLATVDVDPKSPTYSQVIHRTYTGSVGDELHHSGWNVCSSCYDNPDLKRNLLILPGLHSCKVFAVDVGTDPRKPELYKVIDGSEMRSFNCSFPHTTHCLASGDIMISTMGDKNEDGKGDFVLIDSKTLKVTGTWTRGEKIAKFGYDFWYQPYHDVMISSEWGSPKHFKSGFHPGDIPNSERYGTSLNVYKWSTRVLEQVIDLGNEGCAPLEIRFLHDPKSEVGFVGCAVYANVYRFYKSNEGKWKADKVIDIPAKKVIKDGKETLINGLISDILLSLDDKYLYISCWLHGEVRQYDVSDPKKPKLTGKILLGGEIEVPQLVTVKGKKLYGGPQMLQLSLDGKRLYVSSSLYSPWDKQFYPKMVDQGGWIVKLDVDTVNGGIKLDPDFLVDFGKEPNGPVIPHEMRYPGGDCTSDIWLAEK-