Monarch geneset OGS2.0

DPOGS205377
TranscriptDPOGS205377-TA2022 bp
ProteinDPOGS205377-PA673 aa
Genomic positionDPSCF300373 + 14178-36987
RNAseq coverage318x (Rank: top 36%)
Annotation
HeliconiusHMEL0052220.094.76% 
BombyxBGIBMGA008776-TA0.087.69% 
Drosophilauex-PE1e-15970.31% 
EBI UniRef50UniRef50_E2BD063e-17171.98%Metal transporter CNNM2 n=19 Tax=Metazoa RepID=E2BD06_HARSA
NCBI RefSeqXP_625178.25e-17573.26%PREDICTED: similar to CG40084-PC.3 [Apis mellifera]
NCBI nr blastpgi|3287836609e-17473.26%PREDICTED: metal transporter CNNM2-like [Apis mellifera]
NCBI nr blastxgi|3287836602e-16664.35%PREDICTED: metal transporter CNNM2-like [Apis mellifera]
Group
KEGG pathway 
InterPro domain[277-378] IPR0147103.4e-06RmlC-like jelly roll fold
[248-327] IPR0184904.7e-06Cyclic nucleotide-binding-like
Orthology groupMCL10515 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205377-TA
TCCGGCAACAATAATTGCCGCCGAGCTGTCAATAGAGAAAAACATGTCGCTGTCGCAGTAGTGCGTGATCCTGTTACTACGGACGTCAACGACCTGGATAAGGACGAGGTGAACATCATCTCCGGGGCGCTGGAGCTTAGGAAGAAGAAGGTCTCGGACGTGATGACGAAGTTGGAAGACGTGTTCATGCTGCCTATAACGTCTGTGCTGGACTTCGAGACGATGTCCGAGATCGTGAAGTCTGGTTTCTCCCGTATCCCGGTATACGAGGGCACCCGCACCAACATCGTGACCGTGCTCTTCATCAAGGACCTGGCGTTCGTCGACCCTGATGACAACACTCCTCTGAGAACCCTCTGCCAGTATTACCAGAACCCCTGCAACTTCGTCTTCGAGGATGTCACGCTGGATGTCATGTTCAAACAGTTCAAAGAAGGTCACAAGGGTCATATGGCGTTCGTCCACCGCATCAACAACGAGGGCGAGGGCGATCCGTTCTACGAGACCGTGGGTCTGGTGACGTTGGAGGACGTCATCGAGGAGATGATACAGGCTGAGATCGTTGATGAGACGGATGTGTTCACGGACAACCGTTCCAAGCGGCGTCGCAACCGTCCCCAGAACAAGTTGCAGGACTTCGCTGCATTCGCCGAGCGCCACGAGAACCAGAGGATCCATATATCTCCGCAGTTGACTTTGGCCACCTTCCAGTTCCTCAGCACCAGTGTGGATGCGTTCAAGCCGGATACGGTATCAGAAACGGTGCTGCGTCGTCTGCTGAGACAGGACGTGATCCACTACATCAAGATGAAAGGGAAGAGCAAAAGAGAGGCCAGCACGTACGTGTACCAACAGGGGAAAGCGGTGGACTACTTCGTACTGATCCTCGAGGGGCGAGTGGAGGTGACGGTCGGCCGAGAGAACCTCATGTTCGAGGCCGGACCCTTCACATACTTCGGAGTGCAGGCGCTCACGCAGAACGTCGGAGTCGCGGAGTCCCCCACGCCGTCCGCGATGGGCTCCCTCCAGAACATCAACATGGAGGCCATGCTGAGACACACCTTCGTCCCGGACTACTCAGTGAGAGCTGTCTCCGACCTGTACTATCTGGCTGTTAAGAGGTCCCTCTATTTGGCGGCGAAGAGGGCCACACTTATGGAAAAGGGGGCCCTGTCGAAAGGCGCCACTAATGAACAGTTCGACACTGAAGTTGATAAGCGTCTGACAGGACCTTCCGCCCACCAGCTCCGGATCGCTGACGATGAAGACTTGGAGCAGGGTTTCGAGGTGTCGCGGAACAGTTCGAAGAAGTCGCTGGCTATTGTGGACAGCGCGAAGAGTTTGCCCACCGTGGAACAGCCCACGCCGTCAACACTGTCGACCAGGAGCGAGGGGAATATGAAGAAGAAGAATTACTACGCCAAGACATACAAGATCATCATGAACAAGAAGGAGGACGACAGGAGCGGAGATCTGATGTACCTGCCCAGCACCTCCAGGGGCTGGAGGCGAGAGGACACTGAAGAATCCATGAAGGATTCTGCAATGAAAAAAACACAATTCAAGACAGAGAAGTCTCTGCTGACTTTCCACGAACACAAAGACAGTGAGCGTCAGCGAGCTGATTCCAAAGAAAATGATGGCAAGGAAACGGGAAGAGACGGAGGAGGAGCCGACGATGACATAAGAGAGATATACAGCGACGGGATAGTCTCGAATACGAGTCTCGCAGACAAAATGAAACCCACGAAGGAGAAAATCCCGATCGAAGTTTACGAGGATCTTGGAAACCGCATCAGGGACTCGGAACAGCTGGTGCTGGCTAAACTGGACGAACTTCTTCAGTCCGTTGACGAAGACATGAGTATCAGTGGGGAAAACAAGACTCCATCCAGACAGGTGTCCCCTAACCCGGCGGTGCCGCCTTCACCGGCGACGCGCGCCTCCTTCTCCCGGATGTCCCCAGACAGGAACGGGGATATATTCGGAAGAGATGAACAAGAGAAGCTGCTGAAACACTGA

Protein sequence:

>DPOGS205377-PA
SGNNNCRRAVNREKHVAVAVVRDPVTTDVNDLDKDEVNIISGALELRKKKVSDVMTKLEDVFMLPITSVLDFETMSEIVKSGFSRIPVYEGTRTNIVTVLFIKDLAFVDPDDNTPLRTLCQYYQNPCNFVFEDVTLDVMFKQFKEGHKGHMAFVHRINNEGEGDPFYETVGLVTLEDVIEEMIQAEIVDETDVFTDNRSKRRRNRPQNKLQDFAAFAERHENQRIHISPQLTLATFQFLSTSVDAFKPDTVSETVLRRLLRQDVIHYIKMKGKSKREASTYVYQQGKAVDYFVLILEGRVEVTVGRENLMFEAGPFTYFGVQALTQNVGVAESPTPSAMGSLQNINMEAMLRHTFVPDYSVRAVSDLYYLAVKRSLYLAAKRATLMEKGALSKGATNEQFDTEVDKRLTGPSAHQLRIADDEDLEQGFEVSRNSSKKSLAIVDSAKSLPTVEQPTPSTLSTRSEGNMKKKNYYAKTYKIIMNKKEDDRSGDLMYLPSTSRGWRREDTEESMKDSAMKKTQFKTEKSLLTFHEHKDSERQRADSKENDGKETGRDGGGADDDIREIYSDGIVSNTSLADKMKPTKEKIPIEVYEDLGNRIRDSEQLVLAKLDELLQSVDEDMSISGENKTPSRQVSPNPAVPPSPATRASFSRMSPDRNGDIFGRDEQEKLLKH-