Monarch geneset OGS2.0

DPOGS214978
TranscriptDPOGS214978-TA1758 bp
ProteinDPOGS214978-PA585 aa
Genomic positionDPSCF300256 - 320256-322208
RNAseq coverage1000x (Rank: top 13%)
Annotation
HeliconiusHMEL0148140.074.13% 
BombyxBGIBMGA002352-TA0.075.62% 
Drosophilaneur-PA2e-7367.26% 
EBI UniRef50UniRef50_D6WC616e-16349.30%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WC61_TRICA
NCBI RefSeqXP_972157.11e-16349.30%PREDICTED: similar to neuralized [Tribolium castaneum]
NCBI nr blastpgi|910780842e-16249.30%PREDICTED: similar to neuralized [Tribolium castaneum]
NCBI nr blastxgi|910780842e-15950.74%PREDICTED: similar to neuralized [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[52-173] IPR0065738e-55NEUZ
[523-579] IPR0130832.4e-10Zinc finger, RING/FYVE/PHD-type
[532-572] IPR0189579.4e-06Zinc finger, C3HC4 RING-type
Orthology groupMCL11614 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214978-TA
ATGGTCGAGGGTTTGGGCAGAAGAACTTTTATACTGACTCTTGCACCGCAGTCGGGCGAGCGTCGCTCTGTATGTAGAATTCATTCGTTAATGTGTGTGTGTCTGTTTGTTGCAGCTCCTCGCTCCTCCTGTTCGGGCGGAGCGCCCAACAACCTTCCCCCGCTGAGTTTCCACTCCGTCCACGGTGAGAACGTCCGCGTGTCCCGGGATGGCAGCACGGCCCGCCGGGTGGAGTCCTTCTGCAAGGGAGTCGCCTTCAGCGCCAGGCCCGTACGAGTCAATGAGAAGGTGTGCATCCGTTTCGTGGAGATCTCGAACAGCTGGAGCGGGGTGATACGTTTCGGGTTCACGGGTCACGACCCCGCGACCCTCGCGCACGCGCTGCCCAAGTACGCCTGCCCCGACCTCACCAACAAGCCTGGCAACTGGGCCAAGGCTCTGGGGGAGAGGTTCTGTGAGAAGGACAACGTGCTGTACTACTACGTGAACAGCGCCGGGGACGTGCTGTTCGGAGTCAACGGAGAGGACAAGGGACTGTTCTTCTCCGGAGTCGACACCCGGAGTCCGCTGTGGGCCTTGATAGACGTGTACGGCAACTGTACGGCCGTGCAGTTCGCGGACCCGCGGGCGCCCGGCCAGCCCAGGAGGTGCGCCCACACCCCCGTGGACGATGACAGCCTGGTCGGCGGCATGAGGGCGCTCGCCGTGGACGAGGCGCTCCCCCCGCCCAGACACCAGAACCCGCTGGTGCCGCTCACTCTGCACAGGACCAGAGGCAGGAACGTGCACTACGTCAACGACCGGGGCATCGCCGCCCGCGCCGAGGCCGAGTTCTGTCAGGGTTACGTGTTCACGGCGCGGCCCATGAGGCCCGGACAGACCATCGTCGTCCAGATCTTGGCCACGGAGGCGGCTTACGCCGGCAGCCTGGCGATAGGCCTCACGTCCTGCGACCCCGCGCTCCTCACGCAGGACGCGCTGCCGGACGACGCCGACCAGCTGCTGGACCGCCCGGAGTACTGGGTGCTAAAGAGAGACGCCGCTGCCGGCCTGCGCCGACACGACGAGCTGGCCGTCACTCTCTCCGGCGACGGGGAGGTGCGCCTGGCCAGGAACGGCGCGCCGCCCGCCACCGTCATGCACGTGGACCACACGCTGCGCCTGTGGGCCTTCGTCGACATCTACGGCGCCACGCAGAGAGTACGCCTGCTCAGCACGGCCGCCGCGCCCTCCCCGTCCGCCCAGGGCTCCGCGCCGGCCGTGCGCGTGCTGGGCGCGGCGCCCGGCGGGACCGTGCTGGTGGTCAGCTTGCCGCCGCAGCAGCAGCAACACCAGCAGCAGCCGCTAACCCTCGCCAGCGCTCACTACATAGAGCCGCTCCAGTCGTCCCAGCTGTGTGTGTCGGGTCTGCAGCAGGCGGCCGCCCCCTACCGCCGCGACCACTGCGCCTCCAGCTCTCAAAACAGCTGCAACGAGCAGCTGACCCTCCCCGGCGGCGCCGCAGAGCGTCTCCAACCCCTCCAGCAGCTGCAGCCGCTCCCGTCCACCAGCCGCCAGTACTCCATGCTGGAGGGCAACCTGACCGGGGCGGAATGCACGATATGCTTCGAGAACCCCGTGGACAGCGTGCTCTACATGTGCGGTCACATGTGCATGTGTTACCGCTGCGCCGTCCAGCAGTGGCGGGGGAAGGGCGGCGGCCAGTGTCCGCTGTGCCGGGCGCAGATAAAGGACGTCATCCGCACCTACAAGTCCTGA

Protein sequence:

>DPOGS214978-PA
MVEGLGRRTFILTLAPQSGERRSVCRIHSLMCVCLFVAAPRSSCSGGAPNNLPPLSFHSVHGENVRVSRDGSTARRVESFCKGVAFSARPVRVNEKVCIRFVEISNSWSGVIRFGFTGHDPATLAHALPKYACPDLTNKPGNWAKALGERFCEKDNVLYYYVNSAGDVLFGVNGEDKGLFFSGVDTRSPLWALIDVYGNCTAVQFADPRAPGQPRRCAHTPVDDDSLVGGMRALAVDEALPPPRHQNPLVPLTLHRTRGRNVHYVNDRGIAARAEAEFCQGYVFTARPMRPGQTIVVQILATEAAYAGSLAIGLTSCDPALLTQDALPDDADQLLDRPEYWVLKRDAAAGLRRHDELAVTLSGDGEVRLARNGAPPATVMHVDHTLRLWAFVDIYGATQRVRLLSTAAAPSPSAQGSAPAVRVLGAAPGGTVLVVSLPPQQQQHQQQPLTLASAHYIEPLQSSQLCVSGLQQAAAPYRRDHCASSSQNSCNEQLTLPGGAAERLQPLQQLQPLPSTSRQYSMLEGNLTGAECTICFENPVDSVLYMCGHMCMCYRCAVQQWRGKGGGQCPLCRAQIKDVIRTYKS-