Monarch geneset OGS2.0

DPOGS207785
TranscriptDPOGS207785-TA1296 bp
ProteinDPOGS207785-PA431 aa
Genomic positionDPSCF300042 + 148636-152370
RNAseq coverage417x (Rank: top 29%)
Annotation
HeliconiusHMEL0175800.091.18% 
BombyxBGIBMGA005479-TA8e-17188.89% 
Drosophilamol-PA4e-11153.09% 
EBI UniRef50UniRef50_UPI00017932923e-10249.11%UPI0001793292 related cluster n=1 Tax=unknown RepID=UPI0001793292
NCBI RefSeqXP_001658450.18e-11555.50%hypothetical protein AaeL_AAEL007562 [Aedes aegypti]
NCBI nr blastpgi|1571163832e-11355.50%hypothetical protein AaeL_AAEL007562 [Aedes aegypti]
NCBI nr blastxgi|1571163831e-11353.69%hypothetical protein AaeL_AAEL007562 [Aedes aegypti]
Group
Gene OntologyGO:00160213.3e-83integral to membrane
GO:00057893.3e-83endoplasmic reticulum membrane
GO:00150313.3e-83protein transport
KEGG pathway 
InterPro domain[17-298] IPR0184693.3e-83Dual oxidase maturation factor
Orthology groupMCL11249 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207785-TA
ATGAAGGGGTGGTTCGACGGTTTCCGCGAGGAAGGCGGCCCGACCCTGTACGCTTATCACAACCGCACCGCCGTCGCCGCCGACGTGCCCGCGGTCGCCCTGCTTGTAGCCGCCCTTACGTTATACCTCGCGTTCCTTACTATATTCCCTGGTATACGGAAGGAGCGATTTTCCACGTTCACCATCGTTACACTTAGCCTCTTCGTTGGTACTGTGATTTTAGTATGTAAGCACGGATCCTCGTGGCACGTGGCAGGGGCGCGTGTAGCCCGCGCCGCTTACCGCGCCTTCTCAGCTGAGCGACTAGACTGCTGGCTCGCCGTCCACGTCGGCTTGGGACACGTCAATGTCACTCTTAACGCGTTATCATGGGGCAACGTCTCGACAGGCGATCCAGGAATCGATTACAACGAACAATTTCGATGGGAAGAGGCTGGTGCTATCCAGGAATGGTATCGAGCGGGTTTATTACGAGGTTTACCATACCCCGTACTGTCTGTAGCTGAACACTTTGCAGCGGAACACGAGGGATTCGAGTGGGGTGCTAAGTATAGAGCAGCTGGTTATACTACCGCCACGCTTCTTTGGACGGCGTTGGCACTGTGGTTATTAATGAACCTCCTGCTGGTGGTGGTACCACGTTATGGCGCGTACGCAATGGCCTCTCTGGGAGTAACGCTGTGTGCTGCTGCCGGAGGATACTGGGCATCCTTACCACACGCGCCGCTAGTTGTGCGCTTGGATGGAGCCATGCTATTCTTCTCGATGGGCTGGTGTTTCTGGCTAGTCCTGATTGCTGGTGCCATATGTCTGGTAGTGGGTCTACTTATAGCAGCGCTAGACTTGGTGTGGCCTCACAGATTCTCGACCGTACTTGAAGTAGATTACGACACTCCTTATGACAGACACGTCCTCATAGTGGACAGTAGACAAAGAGCTCGTCCACAAACACAGAGCTCTTTGCCATCCAGAATACTGAGGCGATTATCTTCTAAGACCCGAGATCCTGAACGTCATGTGTCAATGTCAGTTGTAAATGAAAGTTCTGGTCGGGACAACCCTGCTTACCAACACGAACAGAGAAAACCTAATTCGCCGTGGAGGTATCCACTGTTTAGACGGCAGATTAATAGAGTCGACTCCGCCTCGAGTATGGGATCAAGCATAAATGGTGCCAGTTCCGGAATAAAGATGTCCCCACTAGTAGGTGGACCCTCATCGAACACACTATCGACGACCGCACAGAGATACCGTCCACAAATAACAGCAGCTGAAAGAGTGAAAGACATGTGGTGA

Protein sequence:

>DPOGS207785-PA
MKGWFDGFREEGGPTLYAYHNRTAVAADVPAVALLVAALTLYLAFLTIFPGIRKERFSTFTIVTLSLFVGTVILVCKHGSSWHVAGARVARAAYRAFSAERLDCWLAVHVGLGHVNVTLNALSWGNVSTGDPGIDYNEQFRWEEAGAIQEWYRAGLLRGLPYPVLSVAEHFAAEHEGFEWGAKYRAAGYTTATLLWTALALWLLMNLLLVVVPRYGAYAMASLGVTLCAAAGGYWASLPHAPLVVRLDGAMLFFSMGWCFWLVLIAGAICLVVGLLIAALDLVWPHRFSTVLEVDYDTPYDRHVLIVDSRQRARPQTQSSLPSRILRRLSSKTRDPERHVSMSVVNESSGRDNPAYQHEQRKPNSPWRYPLFRRQINRVDSASSMGSSINGASSGIKMSPLVGGPSSNTLSTTAQRYRPQITAAERVKDMW-