Monarch geneset OGS2.0

DPOGS208475
TranscriptDPOGS208475-TA1743 bp
ProteinDPOGS208475-PA580 aa
Genomic positionDPSCF300064 - 1494178-1496714
RNAseq coverage474x (Rank: top 26%)
Annotation
HeliconiusHMEL0043050.072.77% 
BombyxBGIBMGA010658-TA0.083.82% 
DrosophilaCG3702-PA0.057.94% 
EBI UniRef50UniRef50_G6DT570.0100.00%Cleft lip and palate transmembrane protein 1 n=2 Tax=Endopterygota RepID=G6DT57_DANPL
NCBI RefSeqXP_001844023.10.058.47%cleft lip and palate transmembrane protein 1 [Culex quinquefasciatus]
NCBI nr blastpgi|3071705750.059.29%Cleft lip and palate transmembrane protein 1-like protein [Camponotus floridanus]
NCBI nr blastxgi|3227927620.061.23%hypothetical protein SINV_00783 [Solenopsis invicta]
Group
KEGG pathway 
InterPro domain[22-559] IPR0084297e-253Cleft lip and palate transmembrane 1
Orthology groupMCL13974 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208475-TA
ATGGTAAACGAAGAAAATTCTTCCGGCGATGCGTCGGATAAATTAGTACCTAGTGATAGCGCTCCGAGACAAAACGAGATGTCTAATGAAAATGTCGATATTACTCCACAAAGTGATGAGCAGAGGCAGAGGCAGCCAACAAAAATGGAATCATTTCTTGCTATAACAAAATCTTTGATTATTAGAGGTTTAGTTGTTTACCTTATAACATCAATGTTTCGTCAGCCCTCACCACCTAAGGCCGACGTGAATAGTCCTTCAGGAGTAGCTAGACTGCCTGCTATTAATATGTTCCCTAATGGAACAGTTGTAGATATGTACACCTATCTGTCAGAAAAAGAATTTTTATATCAATTTGAAGATGCTAAGTTATTATGGAAGCTTCCAGGATTAATTTACGGTGATTGGCATGGAGGTCAAAATGGAGATGGCACATTTACTCAATCTGCGGAATTTATAGCACCAGAGAGTTTAAAAAATAATGGGTCTATATACCTCCATGTTTATGTTGTGCCAGTCGGAAAATCCCCTGATCCAAAGGATAAAGTTAATTATGCGGGACCTTACATTACATATGGAAAGAAAATGGTTAATAAATATAAAAAATTAAAATATCAAAAGACTCATAATTTATTAACCGGGCAAACGGAGAAGTCTGAGGAAGAAGTAAAAAAAGCTGAAACATTGAAAGAGGAGATTGTATCACACTGGCATCCCAACTTAACAATAAATCTGGTAACAGATCAAACTAACTGGATGCAGGGTAGTCTTACATTCCAACCTCTTAGTCTATTTAAATGGCAATTATATACCGCTCAGGCTATGAGAGATAAGTTAAACATGTTTTCAGCATTAGGTGCTGAAGAACAAGATGAGGAACAAGATACTGTCAAAGAATTACTATTAGATACATCCCCTTATTTGCTAGCATTGACTATTTCAGTCTCTATCCTCCATTCAATATTTGAGTTGCTGGCATTCAAAAATGATATTCAGTTCTGGAATAACCGACAATCTCTCGAGGGTTTATCCGTCAGATCGGTTTTCTTTAACGTTTTCCAATCGACGGTGGTACTACTTTATGTGTTAGACAATGAGACAAATGTTATGGTGAGAATTTCATGCTTTATAGGTCTGTTGATTGAGGTATGGAAGATCAACAAAGTAATGGATGTTAAGTTAAATAGAGAAGATAGGATATTGGGATTTCCGAAGTTATCATTTAAAGATAAGGGTTCTTATGTAGAATCTAGTACTAGAGAGTATGATATTCTAGCCTTCAGATATCTCAGCTGGGGCTGCTTCCCACTGCTCATTGGATATGGAGTGTATTCTTTACTATATCTAGAACACAAAGGATGGTATTCGTTTATTCTGAACATGATGTATGGGTATCTACTTACCTTTGGCTTCATTATGATGACGCCACAGTTGTTTATTAATTACAAATTGAAGTCGGTGGCTCATTTGCCATGGCGCATGATGACATACAAGTTCCTTAATACATTCATCGATGACATCTTTGCATTTGTTATCAAAATGCCAACAATGTACCGCCTGGGCTGTTTCAGAGATGACATTGTATTCTTCATATTCCTCTATCAGCGATGGATCTACAAAGTCGATCACAAGAGAGTCAATGAGTTTGGTTTCTCCGGGGAAATGGAACAGCAGAGACAAGTAGATGGGAACGGCACCCTCGCCATTGAGGGACAGGATAAAGCTGATAAGAAAAACGATTAA

Protein sequence:

>DPOGS208475-PA
MVNEENSSGDASDKLVPSDSAPRQNEMSNENVDITPQSDEQRQRQPTKMESFLAITKSLIIRGLVVYLITSMFRQPSPPKADVNSPSGVARLPAINMFPNGTVVDMYTYLSEKEFLYQFEDAKLLWKLPGLIYGDWHGGQNGDGTFTQSAEFIAPESLKNNGSIYLHVYVVPVGKSPDPKDKVNYAGPYITYGKKMVNKYKKLKYQKTHNLLTGQTEKSEEEVKKAETLKEEIVSHWHPNLTINLVTDQTNWMQGSLTFQPLSLFKWQLYTAQAMRDKLNMFSALGAEEQDEEQDTVKELLLDTSPYLLALTISVSILHSIFELLAFKNDIQFWNNRQSLEGLSVRSVFFNVFQSTVVLLYVLDNETNVMVRISCFIGLLIEVWKINKVMDVKLNREDRILGFPKLSFKDKGSYVESSTREYDILAFRYLSWGCFPLLIGYGVYSLLYLEHKGWYSFILNMMYGYLLTFGFIMMTPQLFINYKLKSVAHLPWRMMTYKFLNTFIDDIFAFVIKMPTMYRLGCFRDDIVFFIFLYQRWIYKVDHKRVNEFGFSGEMEQQRQVDGNGTLAIEGQDKADKKND-