Monarch geneset OGS2.0

DPOGS201133
TranscriptDPOGS201133-TA2964 bp
ProteinDPOGS201133-PA987 aa
Genomic positionDPSCF300065 - 619871-626562
RNAseq coverage921x (Rank: top 14%)
Annotation
HeliconiusHMEL0137322e-12267.20% 
BombyxBGIBMGA003937-TA3e-17346.92% 
Drosophilaprominin-like-PB2e-5122.32% 
EBI UniRef50UniRef50_E2AWX01e-13735.48%Prominin-like protein n=11 Tax=Formicidae RepID=E2AWX0_CAMFO
NCBI RefSeqXP_001122309.15e-14533.07%PREDICTED: similar to Prominin-like protein [Apis mellifera]
NCBI nr blastpgi|3454868681e-14634.62%PREDICTED: prominin-like protein-like isoform 3 [Nasonia vitripennis]
NCBI nr blastxgi|3454868684e-14034.64%PREDICTED: prominin-like protein-like isoform 3 [Nasonia vitripennis]
Group
Gene OntologyGO:00160212.7e-132integral to membrane
KEGG pathway 
InterPro domain[78-828] IPR0087952.7e-132Prominin
Orthology groupMCL10281 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201133-TA
ATGAAGGAAACGAGCGCGTCACATACAATTATAAAGTTATCTGATGTCCTTGACCTCGATGAAGCCATCGATCATGGTGCAATGGAGTTGGTAACAATAACTATAAGAAGGAAGCGCGCGACGGCCAAGGTTCGAATTCCCGCGGCCGCGTACGACACCTACATATGTAACATTATTATGGAATTGCGGACATCGCTTAAGGAATTTGTCAGATTAGGAAGAGTATTTGATGGTATCGTGTCGGTGTCTGACGGACATGTGGAGCTGGCGTCTCCTCGCTCGGAGTGGAGGTCTCTGCTGGCTCACTACGCGGGTCCCTTGGCCGTGGCTGTGCTCGTCATACTGTTCGCTGCCGTGCTGCCGCTCTCAGGGTTGTTCTGGTGCTGCTGTCAGTGGTGCAGGGTCGGGCGGCGGCGGCGGCCGTTCGACCGCAAGTATGACGCCTGTCTCAAGGGGATCCTCGCAATAGTTTTCATCGGACTGCTGACGCTGTTCTTGTTTGGCGTGGTCTGTGCCTTCGTCACCGACTCGCAGATGGAGACGGGCGCGAGCGCCGCTCCGAATGTCGTCCGTGCGGCAGTGAGGGACGCACGCACTTTCCTTGACGCCAGCGCTGGTCACGCGCGACACCTTCTCGTCGATAACTTCCGTGAACTGGAGACTCAGCTGGATCGCTCCTTGGCGGCGGGCGGCGCGGTGGTCCTGCGACAACTAGACGAGTTCACCAACGCGACGTCGGTGCGCCGCCTCGAGCTGGTTGCTTCTACGCTAGAACGTGTACCCGATGAGCTGCGCCGTGTGCAAGCCGCAACGGCCGCCCTGCGCGCCGGCGCTGACTCACTCGACGAGGGCCTGCGCCGCGTCAAAGCCTCGCTGTTCAACACGTTAGCACGCTGCCAGGAGCCTCAGTGCGTCACTCTACAGGAAAAATACAAGATCGGCCAACTCTACACCGACATTCAGTACGACAAGATAATAGATAAATATTTTCCAACGATACCGGACGTGTCGGAGCTGCTGGACAACGTGTCTCGCCTGGTGGACGGCGACATGGTCCGGGATGTGCGCGCCGGCCTCCAAGTGTTCACAGGGATCCGTCGTACCGTGGATCAGCACGCGCCACGCGTCAGGGAGGCCGTCGCTGCGACCGGGGAACGACTCGCCAGGGTAGCGGACGAAGTGTCCTGGGCGGCCGGCAACCTCAGCGAGCGGCTGCGGACCTCCCATGCCCCTGACGTACTACAAGAACACTTGCGACAGTACGGGCCGTACGTGCGACATCCCACCAGAGCTGTAGCCGCGGCGCTGTTGGCGATCGTGGTGGTGATGTCGTGGGGCCTGGTGTGCGGCGTGTGCGGGAAGCGTCCCGACGTTTACGGTGCTAGCGACTGCTGCAACAAGGGCTCTGGAGCGTCTTGTATCACATGCGGAATGGCGCTGACATTCGCAGTGGGCGGAGTGGCCGCGGTGGCGATGCTTGTTTACTTCATATTCGGAATCGCAGCGCAAAGACTCGTGTGTGACCCGCTGTCCGAGCCTCGCGGGAGTCGCGTGTTCGTAGACGTCGAGCGGTTCGTGGAGCTAGAGCGATCGCTATATAACGAGCGCTCCGACCCCGACTTCAATCTCACATCAGTGCTGGTGGACTGTCACGCCAACCGCACCATATACCACACGTTGCGTTTGCGTCGCGCCTTCGACCTGGAGTCAGTTCGTGACCGCGTGTCCTCGGACGTGTCGTCGCGTGTGTCGTCTCTCCGGACCGACTACCCTCCCCGCGGTCGCCCGCTGCGTATCCTGGCGCCGGCCGCCCGCGCTCGCCTCGACCGCCTCGCCGCTTCCGGCCTGTCCGACTTCGACTTCGACCGTATCCTGGGCGCTCTCGAGACCAACGTGACGTCACTGTCCCTGGAGGCCCTCGCGACCCAGCTGCGCTCTACGTCCCGCGCGCTGCAGTCCCGCCCCGGCTTCCGGACCGTGGCCGGAGAGCTGACCGCCGCCGCCGATGACGTGTCCTCGCTGCACCGGGACGTCGTCGGACCCATGCTCGAGCGTACCAAAGAACTCAACAAAACGGCTTCCGAACTCCGCGACGCCCTCCGATTCAACGAGTCCTCGCTAAGGGAGGCCATCCTGTCGTGCGTGAGGGACACCAACGAGGTGGAGCTGTTCTTGAACACTCAGGGGCCCGACCTCGTCCAGAACCTGACGCGTGAGTTCGCGGAGACCCTGGGCGAGCGTCTCCAGCAGTACTTGTCGATGGTGGCGCGGGCGGCGGAGCGGGACGTGGGCCGGTGCGGGCCGCTGAGCAACGCCTTCAACGCAACACGTGACGCCGCCTGCCGAGCCTTCGTGCTCCCCGCGAACGGCTACTGGGTGTCGGTGTGTTGGTGCGCCCTGTTGATGCCCGTGGTGCTGTGTGTGAGCGCGCGCCTCGCCCGTCTGTATCGCCGCGCCGAGCCCTACCCCGGGCCGCTCGTCGAGGCGTACGTAACTCCATGCTCTTCTTTCCCTTTACCGAGTGTGTCGGGACTCCTCCGCATCGGCACACGCCTGGTCAATATCGCTGTTCGCTTAACTCACACAGCCGAGTGCATGTTCTGTGGAGGGCCCGCCCCGGAGCCTCGCTTGTCCCCTCACCCCTGTAGTGTACTGTACGCTAGTCTCCTAGCTAGTCTCCCGCCTGCGACTAGACCCGCCGAGTATTTGTATGACGCGTACACGGACCGCGATAACGTTCCCCTCGCCAACGGCTCCAAGAGAAGCACCCTCGACCTGGAGGGGAAGATCGCCGAGTGGAGAGGGAGCGGGGGAGCTACAGCCGATGGGGCGGGGGGGACGGGCACGAGGACGCAGGTGGTGCTGGAGGGGAGGGAGGTAGTCTGTCAGCTGATAGACGACCTCAAGACACGCCTTGACGCCAAAGACGAGGACAACGAGTCCGGCATACACGAGGCAGAGTAG

Protein sequence:

>DPOGS201133-PA
MKETSASHTIIKLSDVLDLDEAIDHGAMELVTITIRRKRATAKVRIPAAAYDTYICNIIMELRTSLKEFVRLGRVFDGIVSVSDGHVELASPRSEWRSLLAHYAGPLAVAVLVILFAAVLPLSGLFWCCCQWCRVGRRRRPFDRKYDACLKGILAIVFIGLLTLFLFGVVCAFVTDSQMETGASAAPNVVRAAVRDARTFLDASAGHARHLLVDNFRELETQLDRSLAAGGAVVLRQLDEFTNATSVRRLELVASTLERVPDELRRVQAATAALRAGADSLDEGLRRVKASLFNTLARCQEPQCVTLQEKYKIGQLYTDIQYDKIIDKYFPTIPDVSELLDNVSRLVDGDMVRDVRAGLQVFTGIRRTVDQHAPRVREAVAATGERLARVADEVSWAAGNLSERLRTSHAPDVLQEHLRQYGPYVRHPTRAVAAALLAIVVVMSWGLVCGVCGKRPDVYGASDCCNKGSGASCITCGMALTFAVGGVAAVAMLVYFIFGIAAQRLVCDPLSEPRGSRVFVDVERFVELERSLYNERSDPDFNLTSVLVDCHANRTIYHTLRLRRAFDLESVRDRVSSDVSSRVSSLRTDYPPRGRPLRILAPAARARLDRLAASGLSDFDFDRILGALETNVTSLSLEALATQLRSTSRALQSRPGFRTVAGELTAAADDVSSLHRDVVGPMLERTKELNKTASELRDALRFNESSLREAILSCVRDTNEVELFLNTQGPDLVQNLTREFAETLGERLQQYLSMVARAAERDVGRCGPLSNAFNATRDAACRAFVLPANGYWVSVCWCALLMPVVLCVSARLARLYRRAEPYPGPLVEAYVTPCSSFPLPSVSGLLRIGTRLVNIAVRLTHTAECMFCGGPAPEPRLSPHPCSVLYASLLASLPPATRPAEYLYDAYTDRDNVPLANGSKRSTLDLEGKIAEWRGSGGATADGAGGTGTRTQVVLEGREVVCQLIDDLKTRLDAKDEDNESGIHEAE-