Monarch geneset OGS2.0

DPOGS209857
TranscriptDPOGS209857-TA1083 bp
ProteinDPOGS209857-PA360 aa
Genomic positionDPSCF300451 + 6049-17216
RNAseq coverage1397x (Rank: top 9%)
Annotation
HeliconiusHMEL0169692e-12368.67% 
BombyxBGIBMGA002345-TA1e-11867.27% 
DrosophilaCG7816-PA3e-10055.80% 
EBI UniRef50UniRef50_Q9VAF05e-9855.80%Uncharacterized protein CG7816 n=28 Tax=Eumetazoa RepID=Y816_DROME
NCBI RefSeqXP_001955492.14e-9956.08%GF16226 [Drosophila ananassae]
NCBI nr blastpgi|1947460408e-9856.08%GF16226 [Drosophila ananassae]
NCBI nr blastxgi|1953413394e-9555.96%GM12209 [Drosophila sechellia]
Group
Gene OntologyGO:00160208.5e-52membrane
GO:00550858.5e-52transmembrane transport
GO:00468738.5e-52metal ion transmembrane transporter activity
GO:00300018.5e-52metal ion transport
KEGG pathway 
InterPro domain[20-356] IPR0036898.5e-52Zinc/iron permease
Orthology groupMCL14467 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209857-TA
ATGGCGGGATTTTTACCTGAATATTTTTATACTTTTGTGGAGGAATTAGACGAACATCCCTGGTTATTTTCGGCTCTCGCATCAGTTCTGGTCGGTTTGAGTGGAATCTTGCCTTTGCTAATCATACCCATCGATGAAACTGCCAACTTTAAAGACGGCGCTGGCGCTGCGACTCTCCGTATCCTATTGAGTTTTGCTGTGGGCGGTCTGCTGGGAGACGTGTTCCTTCACCTGCTACCGGAGGCATGGCATCACGACATGATGAGCAGTAAAGGGGGTGAGGTGTCAATGAAGTGTGGGATATGGGTTCTGGTGGGAATGCTGGTGTTCGTGATAGTCGAGAAGCTGTTCGCGAGCTCGGAGGAGGAGGATCCCAAGGTCGAGGCCGTGGAGATAGAAGACATCGAGATCCTGTTACGAGCACAGAAGAGACACACGGAAAACGGGTCGTTGACCGACAAACAGATGATGGAGACCTGTGTCTTCAATAATAATACAAAAGGTGATGCAGTGTGCTGCAGCGCTGTATCCACCTGCGGGTACAAGGGCTCCAGGTGGATGGGCAGGTGTCTGCTCAGGGAGGCTCGGGAGAAGAGTTTGATGAACAAACAGAAGAACGACAAGAAAGATGTGGCGGGTTACCTGAACCTGATGGCCAACTCCATAGACAACTTCACCCACGGCCTGGCGGTGGGCGGTTCCTTCCTCGTGGGCTTCAGGGTGGGGCTGCTCACTACATTCGCTATACTCGTCCATGAAATCCCTCACGAGGTCGGTGACTTCGCGATCCTCCTCAAGAGCGGCTTCTCCAGGTGGGAGGCGGCGAAAGCTCAGATCGCGACGGCTGCGGCCGGGCTTTTGGGGGCGATGACGGCGGTCGTGTTCAGCGGAGCCAGCAACGCTATCGAGGCCCGCACGTCATGGATCGCTCCGTTCACGGCGGGCGGGTTCCTGCACATCGCCTTGGTGACGGTGCTGCCCGAGCTGCTGCGGGAGCACGACCGCCTCGAGTCCGCCAAGCACCTGGCCGCCTTGCTGGCGGGCCTGGCGCTCATGGCCGCACTCAACCATTACTGCGGATGA

Protein sequence:

>DPOGS209857-PA
MAGFLPEYFYTFVEELDEHPWLFSALASVLVGLSGILPLLIIPIDETANFKDGAGAATLRILLSFAVGGLLGDVFLHLLPEAWHHDMMSSKGGEVSMKCGIWVLVGMLVFVIVEKLFASSEEEDPKVEAVEIEDIEILLRAQKRHTENGSLTDKQMMETCVFNNNTKGDAVCCSAVSTCGYKGSRWMGRCLLREAREKSLMNKQKNDKKDVAGYLNLMANSIDNFTHGLAVGGSFLVGFRVGLLTTFAILVHEIPHEVGDFAILLKSGFSRWEAAKAQIATAAAGLLGAMTAVVFSGASNAIEARTSWIAPFTAGGFLHIALVTVLPELLREHDRLESAKHLAALLAGLALMAALNHYCG-