Monarch geneset OGS2.0

DPOGS210739
TranscriptDPOGS210739-TA1251 bp
ProteinDPOGS210739-PA416 aa
Genomic positionDPSCF300013 + 274399-279272
RNAseq coverage145x (Rank: top 54%)
Annotation
HeliconiusHMEL0046050.090.60% 
BombyxBGIBMGA006271-TA0.080.14% 
DrosophilaCG42322-PJ2e-10343.64% 
EBI UniRef50UniRef50_B7Z0N33e-10143.64%CG42322, isoform I n=9 Tax=melanogaster subgroup RepID=B7Z0N3_DROME
NCBI RefSeqXP_002424324.12e-10952.80%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|3800110669e-10949.08%PREDICTED: solute carrier family 35 member F3-like [Apis florea]
NCBI nr blastxgi|2420069883e-10952.93%conserved hypothetical protein [Pediculus humanus corporis]
Group
KEGG pathway 
Orthology groupMCL11652 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210739-TA
ATGGCTAGAGATGGGGACGTGCCTACTATTTTCAATCCTAAGAGAGTACGCACGCCGTCTGTTATAATAACTAGCGAAGAAGTGGAAAGCGAGCAGGGTGGTGATGTCGAGCGCGAACGTCCAAACGGGCCTCCGACGCCGATAAGTCCTAGGGCGCCGCTAACTCCGCAGACTTCGGTGTCTTCAGCCCCACCTTTGACCACCCAACCGTCGCTGCAGCAGGCCTACAGCGAGTCTGCAGCTAGCAGTCAAGATGAGGAGCGGCAATTACGAGATAAAAAGTGCCCGCGGTCATGCTGTTCCAAAGTAGCCAAAAAGATTTACTATGGCCTCTGTGTCACAACAACCATTGTTGTTTCCTGGGTGACTGTGACCCAATCCATAAAATACATGTATCTTCAGAAGTACTCCCATGCGGACATCTTTTTGGGTGGTACAGAATCGCCAATTAATAATAATTATACATCTAGAATTCCAACACATCACCATTTCAATGCCCCGTTCTTCACTGCATGGTTCTGTACAAACTGGCTTATATTTTTCTACCCTCTTTACTTAATAATTATGCTTATTCACAACAAATGTAAATCAGCCAATACCATTGTGGGTGATGCTTTCACGGACTTTAAGGAAAAGGGATTCACTTTTGCCCGTTTTCTGAGTCGCTGCGGTTTGTTCTGTCTGTTGTGGGTGGTCACTATATACATGTACACATATGCCTTGAAGATACTTCTATCAACGGATGTGGTGGCACTGTTTGCTACTAATGTATCCTGCGTGTATCTCCTCTCGTGGGTGATACTTCATGAGCAGTTTGTTGGTGTGAGGATAGTGGCGGCTATCCTGTGCGACACGGGTATAGCCCTGCTAGCATATATGGATGGTATCACCGGTAGTTCAACGTTGGGTGGTGTTGTGCTCGCCGCTTGTGCTGCTGCTGGGTTTGCTATATTCAAGGTTTTGTTCCGCAAGGTGATGGGTGAAGTGAGCAGCGGCCAGCGAGCTCTGTTCTTCTCGGTGCTGGGCGTGGTGAACGCGACCCTGCTGTGGCCGGTGAGCCTGGCCCTGTGTCTGACCGGCGCCGAACAGCTGCCGGCCTTGAGGATGCCCTGGGTGCCGCTGTTGTTGGCCAGTGTTGCACTGCTCGTTTTCCACCTCGTGTTCCAGTTCGGCAACCTCATGACCTACAACATCTTCGTGTCTCTGGGGCTCATAGCGGCCGTGCCTGTGTCTGCTGGTGATTACAACTAA

Protein sequence:

>DPOGS210739-PA
MARDGDVPTIFNPKRVRTPSVIITSEEVESEQGGDVERERPNGPPTPISPRAPLTPQTSVSSAPPLTTQPSLQQAYSESAASSQDEERQLRDKKCPRSCCSKVAKKIYYGLCVTTTIVVSWVTVTQSIKYMYLQKYSHADIFLGGTESPINNNYTSRIPTHHHFNAPFFTAWFCTNWLIFFYPLYLIIMLIHNKCKSANTIVGDAFTDFKEKGFTFARFLSRCGLFCLLWVVTIYMYTYALKILLSTDVVALFATNVSCVYLLSWVILHEQFVGVRIVAAILCDTGIALLAYMDGITGSSTLGGVVLAACAAAGFAIFKVLFRKVMGEVSSGQRALFFSVLGVVNATLLWPVSLALCLTGAEQLPALRMPWVPLLLASVALLVFHLVFQFGNLMTYNIFVSLGLIAAVPVSAGDYN-