Monarch geneset OGS2.0

DPOGS211750
TranscriptDPOGS211750-TA1632 bp
ProteinDPOGS211750-PA543 aa
Genomic positionDPSCF300512 + 826-23748
RNAseq coverage985x (Rank: top 13%)
Annotation
HeliconiusHMEL0067421e-15584.18% 
BombyxBGIBMGA011558-TA1e-10371.14% 
Drosophilaprom-PD5e-4130.70% 
EBI UniRef50UniRef50_UPI00021A635C1e-4333.55%UPI00021A635C related cluster n=2 Tax=unknown RepID=UPI00021A635C
NCBI RefSeqXP_001656808.19e-4532.07%hypothetical protein AaeL_AAEL003441 [Aedes aegypti]
NCBI nr blastpgi|1571362762e-4332.07%hypothetical protein AaeL_AAEL003441 [Aedes aegypti]
NCBI nr blastxgi|2556528738e-4448.39%cuticular protein CPFL family 3 precursor [Bombyx mori]
Group
Gene OntologyGO:00160211.4e-32integral to membrane
KEGG pathway 
InterPro domain[1-302] IPR0087951.4e-32Prominin
Orthology groupMCL12691 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211750-TA
ATGGTTTCAGCTGTATTCCTGATATCAGGACACGCTGAGGTGTTTGCCTGTCAAATTCTCTGGGATTCTCCACAATACAATACGTTGTCATCATTACTAAATAGGCCATCGCCTCTTCTAACAGATAATATGGGTATCTTTGATGCTTTGTTTCAAGATCTTGACAATGTTACTATAGAAGTGTCAGTCAAAGACGTTCTGAGAGAATGTGAAAAGAATGAACCCGCCTACGTTGTGTTCCAGTTAGAAAAAATACTAGACGTGAATAAGGAGACGTCTTACTTTGAGTGGGACGAACTACAAGATGACTTGAAAAGATTATCCGCTTCTATAGATGTTGGTTTCTTGAAGACGATATCAGATAACTTCAATAAGTTACTCAATAGGATTCTCGTTATATCAGACGTCAATTTGACGAAATATAGAATGGAATATAACGGACCCGTGGTAGGAAAGGATTTGCCTTCCTTTGTAGATCAACTGGAAAACGTTGCGATTCAAGTAACAGATTTAAATACAGCTGGAAGATTAGAGACTCTAGCGACGAGGACTCAAAGACTTCATTTATCGAACATAAAGCCACTAGAACATCTGCGAACAGAACTAGTGTTCAAACTGACCGAACTTGAATTGCAAATGATGCCTTTTAGGCGGAAATTAAACATCTCATTAAGTCATATCCACACAGCGCAGTTTTATATTGATAATCAAGGAGACGTCATTGCTCAAAAGAAACTGTCAACGTTCATTACCCGTCTTTTATCACACACGGCCGGTTGGCGAACGCACGTGTTAGACGCAGCGGGACAACACGCCGGTCGCTGTCGACCGCTCTATGATGTGTTCGCGGCAGTCAGATCGCTCGTGTGCACCAGATATGTTTCTTCATTGATTCCCGTTATTTTAATTATGGCCTTTTTTAATATAACGGTCATCGCTCAACAACAGTATCCACATTTGTTCACGGTGCAAACCAGAGAACCAGCTCCAATATACGAAGCCCAAAATGAAGGATTTCTAGGCCATCAATTCATCGTCTTCCTCTGCGCTATCGCCACCGCTCGTGCTGGTGGGTTGTATGGAGCATCATATGGAGCGACATATGCTGCACCTGTAGCGTCATATGCAGCAGCTCCTATCGCTACCTATGCTGCTGCCCCAGCTGTCTACAAGACCTATGCAGCCCCAGCCATACAAGCATACTCTGCTCCAGCAGTATCTTACTCAGCACCATTAGTCAAAGCGGTCGCTCCAGCTGTTTCGACTATCTCATCTTACTCAACTCACACGGCACATGGTACACCAGTGGTAGCTAAAGCTATTGCTCCTGTAGCCATCCAGGCCGCACCCGCAGTATCTTATGCTGCTGCTCCCGTTATTAAGAGTTACGCTGCACCAGCAATATCTTACGCAGCACCCGCTATTTCTTCTTATGCCGCGTCTCCTGTGATTAAGAGTTACGCAGCCCCGGCCTACCAAGCATACTCTGCTCCATCAGTATCATACGCTGCCCCCATTTCTTATGCATCCGCTCCTCTTATCAAATCTGTAACATACTCTGCAGGTCCAGCGCTCTCATACGGAAGCCTCTCAGGACACTACGGTGGACTCTCCGGTCACTACGGTTGGTGA

Protein sequence:

>DPOGS211750-PA
MVSAVFLISGHAEVFACQILWDSPQYNTLSSLLNRPSPLLTDNMGIFDALFQDLDNVTIEVSVKDVLRECEKNEPAYVVFQLEKILDVNKETSYFEWDELQDDLKRLSASIDVGFLKTISDNFNKLLNRILVISDVNLTKYRMEYNGPVVGKDLPSFVDQLENVAIQVTDLNTAGRLETLATRTQRLHLSNIKPLEHLRTELVFKLTELELQMMPFRRKLNISLSHIHTAQFYIDNQGDVIAQKKLSTFITRLLSHTAGWRTHVLDAAGQHAGRCRPLYDVFAAVRSLVCTRYVSSLIPVILIMAFFNITVIAQQQYPHLFTVQTREPAPIYEAQNEGFLGHQFIVFLCAIATARAGGLYGASYGATYAAPVASYAAAPIATYAAAPAVYKTYAAPAIQAYSAPAVSYSAPLVKAVAPAVSTISSYSTHTAHGTPVVAKAIAPVAIQAAPAVSYAAAPVIKSYAAPAISYAAPAISSYAASPVIKSYAAPAYQAYSAPSVSYAAPISYASAPLIKSVTYSAGPALSYGSLSGHYGGLSGHYGW-