Monarch geneset OGS2.0

DPOGS212202
TranscriptDPOGS212202-TA1788 bp
ProteinDPOGS212202-PA595 aa
Genomic positionDPSCF300323 - 117268-120010
RNAseq coverage5611x (Rank: top 2%)
Annotation
HeliconiusHMEL0166920.069.81% 
BombyxBGIBMGA001168-TA0.063.55% 
DrosophilaCapr-PA5e-4043.78% 
EBI UniRef50UniRef50_Q2F6950.063.81%Glycosyl-phosphatidyl-inositol-anchored protein n=2 Tax=Obtectomera RepID=Q2F695_BOMMO
NCBI RefSeqNP_001040301.10.063.81%glycosyl-phosphatidyl-inositol-anchored protein [Bombyx mori]
NCBI nr blastpgi|1140514830.063.81%glycosyl-phosphatidyl-inositol-anchored protein [Bombyx mori]
NCBI nr blastxgi|1140514830.066.03%glycosyl-phosphatidyl-inositol-anchored protein [Bombyx mori]
Group
KEGG pathway 
Orthology groupMCL17952 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212202-TA
ATGCCTTCAGCTGCGAATGCAAAGTCTGAAAAACCAGCTTCTGCGGAAGCTACGGATAATTCACCAATACGACAGATAATGACCATTATTGAACATAAAATCCGGAACTTGGAAAAAAGGAAGGGCAAATTAACATCGTACCGTGATTTGCAAAAAGCCGGTAAGGAATTAAACAGTGATCAAAAGGAAGCGGTGGCCAAGTATGATGAGGTGGTTATAACCCTAGAGTTCGCTAGAGACCTCTCGAAACAAGTGGCTTTGATAGCTATAGCGTCAGAACGTGAAGCCAAGAAACAAGCTAAGAAGGATGCCTGGGTTCGTTATACGGCAGACACGAATAAAATACGTGAGGTACTCCTTATATTGGACTGCCTCATGCAGCTGGGTAACGCGGAAGTTAGAGACGACTTCCTTAATGGTACCAATGGTGCAGTGAAATTAACTGAGGAGGATTTAAAAATCTTAGATAACTTGTATCCTGAAGTGACCCCTAAACATGAAGTGAATGAAGAAGGCCAGCCTGGTTTTCATGCGTATACAATAAAAGCTGCGGAACATCTGTATGCTATTATTGATGGAAAACCCAAAGAAGTCCTCGGCACCACATATGCCCATGTCAAGGAAATAGTCACGGCCGTTCATGAGTGTGGCTACTTTGATAAATCAGCTGATGTTATCCCTGAACCAGAGGAGATACAGAATGTGTGTGAAGAAGCACCAGAACAGACGGAAGAAATTGAGGAGTCTGAGCCTGCTCCAATGTATGTGGCCCCTGTCCCTATGTCAGCCCCACCGGCAGCCGTAGTTCCTGCCCCAGGATACCCACTAAGACCGATCCCCCCCATCACCCTTCAAGAAATTGAGAATGCTTATTTCTCACAACAATATCCCCAACAAAGGCCTATATCTGAAGTGATAGGCTCCCAAAACTTCTTCTTCCTACAAGAATCTGAGATTGACAGTCCTGTAGGAACTCCACAACCTCCACAGATCATGAACCAGCCTTCTCCGCCAGGACCTATCCCAACACAGACATTTACTAATCAACACTTCGTCCAACTGCCTGGAGGTCGTGTTCCTGAACCTGGAACCATTCCAATGCCACCTCAACCTCACTTCCCCCCTCATCCAGACCACGCGGCTTATCAAGTTCCGATTCCTCAAATCCATCCCCAACCTCACCACGTTCCTCAGTCAATCCCTCAACCGATGCAACACAACCAACCGATCCCACATATCGAGCAACAACCCAACTTTGAGCAGGAAGAATTGAAAGTTATATCACCCGTCGAAGACAAACCGGAAGACGAAGTAAAAGAAAGCGCCAGCCCGGAAAGAGAAGATGGAGGGGAACGAAAAACACAAGGGCAGGGGGACGGACAGAACCGGTTTAGACGGTACCGGGGTAACGGGAGGGGCTCGTCCAACGGATTCAGAGGTCGAGGAGCTTACACGAACAGGCAGAGCGAAGGATACCACCCTAGACACAACGACTACCAGAACAGGAGTAACAAAGAGAGTTATCAGAACCGCCAGTACAACGACGGCTACCAGAACAGGCACGGCAAAGACAACTACCAGAACAGAAGCGGAAATGACTACTACGGCAACGGAGACAACGGGGACTCCCATCATAACGAGAACGGCTTACGTTATAACGACAACTCCAGCTACCAGCCAGGGTTCAAGAGCCGCGGCAGAGGAGGCCCCCGCGGAGGAACACGGGGCGCTCCACGCACGCCGCGCACGAACCATCAGTACAACCGTAAACAGGAAAACGTGGAATAG

Protein sequence:

>DPOGS212202-PA
MPSAANAKSEKPASAEATDNSPIRQIMTIIEHKIRNLEKRKGKLTSYRDLQKAGKELNSDQKEAVAKYDEVVITLEFARDLSKQVALIAIASEREAKKQAKKDAWVRYTADTNKIREVLLILDCLMQLGNAEVRDDFLNGTNGAVKLTEEDLKILDNLYPEVTPKHEVNEEGQPGFHAYTIKAAEHLYAIIDGKPKEVLGTTYAHVKEIVTAVHECGYFDKSADVIPEPEEIQNVCEEAPEQTEEIEESEPAPMYVAPVPMSAPPAAVVPAPGYPLRPIPPITLQEIENAYFSQQYPQQRPISEVIGSQNFFFLQESEIDSPVGTPQPPQIMNQPSPPGPIPTQTFTNQHFVQLPGGRVPEPGTIPMPPQPHFPPHPDHAAYQVPIPQIHPQPHHVPQSIPQPMQHNQPIPHIEQQPNFEQEELKVISPVEDKPEDEVKESASPEREDGGERKTQGQGDGQNRFRRYRGNGRGSSNGFRGRGAYTNRQSEGYHPRHNDYQNRSNKESYQNRQYNDGYQNRHGKDNYQNRSGNDYYGNGDNGDSHHNENGLRYNDNSSYQPGFKSRGRGGPRGGTRGAPRTPRTNHQYNRKQENVE-