Monarch geneset OGS2.0

DPOGS206398
TranscriptDPOGS206398-TA1410 bp
ProteinDPOGS206398-PA469 aa
Genomic positionDPSCF300192 + 252050-253459
RNAseq coverage499x (Rank: top 25%)
Annotation
HeliconiusHMEL0090220.091.90% 
BombyxBGIBMGA005797-TA7e-12087.87% 
DrosophilaCG11210-PA4e-12049.88% 
EBI UniRef50UniRef50_B4LJM09e-11947.60%GJ22131 n=5 Tax=Drosophila RepID=B4LJM0_DROVI
NCBI RefSeqXP_001654529.11e-12654.65%hypothetical protein AaeL_AAEL010404 [Aedes aegypti]
NCBI nr blastpgi|1571260873e-12554.65%hypothetical protein AaeL_AAEL010404 [Aedes aegypti]
NCBI nr blastxgi|1571260878e-12954.65%hypothetical protein AaeL_AAEL010404 [Aedes aegypti]
Group
Gene OntologyGO:00160203.4e-67membrane
KEGG pathwayssc:1001551732e-13 
 K00728 (POMT)maps-> O-Mannosyl glycan biosynthesis
InterPro domain[63-376] IPR0038643.4e-67Domain of unknown function DUF221
Orthology groupMCL11004 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206398-TA
ATGGTCACGGAAGCTCTTATATATACCGATACATATTACAGGAAAACTGGAAAAAGAATTACTGTGAAACCAACACTTTGTGGACCGGAAGTTGATGCTCTAGAATTCTACACTCAAGAAGAAAAACGTCTCAAAGATGAAGCTAAGAGATGTCGTGCTTTGGTCCTCAATGATCCTCTCGGTATAGCCTTTCTAACTCTTCCTTCATACCAACTCGCTGAACATGTCATCAATCATTTCAGTTTGCATAGAGGTTGGGTTCTGCAACACGCCACCAACCCATCAGATATTATATGGGAGAATTTAAGTGTCCAGCCTGGGGTCTGGTATGTAAAGGCCATAGTGGTTAATGTCTGTCTGTTTATTGTTTTGTTCTTCCTTACAACACCAGCTTTCGTTGTCAATCTCTTCAACACTCTGGTAGCTAAACCCGAGACTCTTAGTAAAATAAGTAGTTTGATATTTGAATTCCTACCTACGCTACTGCTTTGGACCATGGCGGCTGTAATGCCTGCCATTGTCGCCTTCTCTGATAAATTTCTTTCCCACTGGACAAAGTCTCAGCAAAACTATTCTATTATGGCAAAAACTGCGTCATATTTATTATTAATGACCTTAATTCTACCCTCGTTGGGACTAGCCAGTGCCGAGGCTTTCCTTGCATGGACTTTACACCACGAAAACGATACGTTGCGTTGGGATTGCGTATTTTTACCAGACAAGGGCGCCTTCTTCGTCAATTACGTCATCACCTCGGGCTTCATCGGTACAGCGCTCGAACTTATACGATTTCCGGAATTATTCTTATACGTCTGGTATCTTTTACAATCTAAATCTAAAGCAGAAAAAAGTTACGTAAAGAAGGCCATCTTATATGAGTTTCCTTTCGGAGTCCACTACGCATGGAGTCTTGCTATATTTTCTATAACAATGGTGTACAGTCTCGCATGTCCATTGATCGCACCGTTTGGTCTCATCTACTTCGTTCTGAAACACATAGGTGACAAACATAATCTGTACTTCGCGTACGGTCCGTGTGATATGAGCGGGGTCGGCGGCGGTAGAATCCACGCAACGGCCGTCAGGTTGATCAGAATATCTGTGTTGCTATTGTTGATAAACATGGCGGCCTGGGCGGGCCTGCGGGCCGGTTTCGAGGCTCGGACTATTATTTTGATAATGGCATCCGTAGTCGCGTTTGGAACATTCCTGCTGTTGAGTCCTTTCCCCAGCTGTACGCCGCCAGCGCCTCTACAGTCGGAAACGACGTTGCGCTTCCCCGAGTACGTGGCCCCGGTCCTCACTAAGCCCATAGACACTCCGCCCACATCCGCCTCCTCCACGCCGGACTACGGCGCCTCCTCGCCGGTAGTCAACCTTTCTTACAATCCAGAACCTATTAACATTTAA

Protein sequence:

>DPOGS206398-PA
MVTEALIYTDTYYRKTGKRITVKPTLCGPEVDALEFYTQEEKRLKDEAKRCRALVLNDPLGIAFLTLPSYQLAEHVINHFSLHRGWVLQHATNPSDIIWENLSVQPGVWYVKAIVVNVCLFIVLFFLTTPAFVVNLFNTLVAKPETLSKISSLIFEFLPTLLLWTMAAVMPAIVAFSDKFLSHWTKSQQNYSIMAKTASYLLLMTLILPSLGLASAEAFLAWTLHHENDTLRWDCVFLPDKGAFFVNYVITSGFIGTALELIRFPELFLYVWYLLQSKSKAEKSYVKKAILYEFPFGVHYAWSLAIFSITMVYSLACPLIAPFGLIYFVLKHIGDKHNLYFAYGPCDMSGVGGGRIHATAVRLIRISVLLLLINMAAWAGLRAGFEARTIILIMASVVAFGTFLLLSPFPSCTPPAPLQSETTLRFPEYVAPVLTKPIDTPPTSASSTPDYGASSPVVNLSYNPEPINI-