Monarch geneset OGS2.0

DPOGS202390
TranscriptDPOGS202390-TA1776 bp
ProteinDPOGS202390-PA591 aa
Genomic positionDPSCF300515 - 34841-47376
RNAseq coverage426x (Rank: top 29%)
Annotation
HeliconiusHMEL0142724e-13456.61% 
BombyxBGIBMGA001772-TA4e-11649.46% 
DrosophilaCG7518-PF9e-0755.81% 
EBI UniRef50%
NCBI RefSeqXP_001844141.15e-0853.19%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastp%
NCBI nr blastxgi|1943539376e-1329.03%uncharacterized protein LOC792544 [Danio rerio]
Group
KEGG pathway 
Orthology groupMCL19871 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202390-TA
GTGTGTACTCAGCTGGTGGGCGGCCGAGCGCCCGCTCCCCCGCCGCCCGGGACTCCGCCCCCCGCCCCCGCACACGCGCCCTGCAGGCCCCTCTGTCCGGAGTACCACCTGCAACGGTTTAAGGAGGTGCGTCGGAAGGTGATGCATATGAGAGACCAACAGCTGTACTACCACAGACTGAGGATTTGGCTCGAAGACCAGCTCAGGGTCGAGAGGGAGAGCAAAAAAGATAAAACGGAACCTCTTAAAGTAGACAATAATAAAAAGAAAGATAAAGAAAAAGATAAGAAGGAGGAGAAGAAAGAAAAGAAAGAAGAGAAGAAAGATAAGAAGAAAGAAGATAAGAAAGGAAAGGAAAATAAACAAGAAAAACCTGCACCAGCTAAAAGTAAATCTCAAACTAAGAGAGATAAGAAGGGTCAGCCAGCTAATCCAACTGGTCATTCAGCGGGTCAGGGTCAAGTCATCAACATAACGCCGGACACAACCATAGAGGTTGTTAGCAAAAAGGTCGTGAGCAGCGCTGAAGCCGACAAGCCCTCGTCTTGTAGCATTATGGAACAACTGAGTAGTGGAGTCCAGGTCGCTGATCTCAAGCTTCCACCGGGTATAACACTGACCAGGGTCCAGCCCGGAGAGAAGAAAGATCCGCCTGTTATAAAGACACTTGCCCCGCCTCGTCGACCGATAGTCGCTCCCACCGCCATAGAACCACCCAAGATAATAATACCAGAAGCGACGACGAAAACAAAGAAAGGCAAGAAGAAAGCGAAGAAATGTGACGGCGAAGTGAAGCCGGAAGCCAAGATGGTGACGCTCAGGAACCCCATGTTCCATCCCAACCTGCCGCCGGTGCAAGTGCAGCCGCAGAAGAACGAGATACGGGTCCCCGAGCCCATCCCCATGCCGCCCGCGCCCTGCCAGGCCACCATCACGCCGACCTCCAACGGCATGTACACTATACGGAACCCGCTCATGTCCATGATGCACCAGCAGAACCTCATCCGGACCCAGAACCCGCAGGCCTACCACTACAACTACATCAACAGCTACGACCAGAAGCCGGAGACGGAGCAGAACTTCCACAACAGGATCATGAATCTCGCGTCGTTCACGCAGAAGAACGACGAGGGCTACTCGCTGTTCAAGACGAACAGCAGCCAGGAGAAGAGCTTCCTCAGTCCCGAGTACTTCGACACGCCCAAGGTCTCCCCGAACCCGATCGGCACCAGGCCGGGCTACGAGGACTCGCTGTTCACCACGGCGATCCCGGAACCGATCGGGCCGCCGCTCAGGAAGGAGGACTACAGGGGCTACACGCCCTTCGGCCAGGAGGACAGGAACGTGTTCAGGAACGCGCTGTTCAGCGACAAGCCCGATAAGGCCGATCACATGAACGGCGAGCTGCCGTACTTCCAGCGGCTGCGGGCGGGCGCCAAGCTGAACAACGAGGTCACCGTGCACTACGTGACGGATTCGAAGCTGTACAAGGGGCAGGAACACAAAGAGCCGGATTCCTCACTCTTCTCCCCGTGGCCCAATAACTGCCAGCCGGCGATGAACGAGACAACAGAAGAGGCGTCGCAAGCGCACAGACCCGAGTCCAGCGTGTTCCTCCCAGACCGGTCTTTAACGAACCTGTCCAGTCTAGACGCGTCTGAGAGAGACATAGAATCCTTCAAGAGGTTCGATTTCTTCTTCGACCCGCCGCAGTACAAGCCCAAGGTCCAGCTGGACGTGCGCAACATAGCGGACACCATGAGACACAACAAATAG

Protein sequence:

>DPOGS202390-PA
VCTQLVGGRAPAPPPPGTPPPAPAHAPCRPLCPEYHLQRFKEVRRKVMHMRDQQLYYHRLRIWLEDQLRVERESKKDKTEPLKVDNNKKKDKEKDKKEEKKEKKEEKKDKKKEDKKGKENKQEKPAPAKSKSQTKRDKKGQPANPTGHSAGQGQVINITPDTTIEVVSKKVVSSAEADKPSSCSIMEQLSSGVQVADLKLPPGITLTRVQPGEKKDPPVIKTLAPPRRPIVAPTAIEPPKIIIPEATTKTKKGKKKAKKCDGEVKPEAKMVTLRNPMFHPNLPPVQVQPQKNEIRVPEPIPMPPAPCQATITPTSNGMYTIRNPLMSMMHQQNLIRTQNPQAYHYNYINSYDQKPETEQNFHNRIMNLASFTQKNDEGYSLFKTNSSQEKSFLSPEYFDTPKVSPNPIGTRPGYEDSLFTTAIPEPIGPPLRKEDYRGYTPFGQEDRNVFRNALFSDKPDKADHMNGELPYFQRLRAGAKLNNEVTVHYVTDSKLYKGQEHKEPDSSLFSPWPNNCQPAMNETTEEASQAHRPESSVFLPDRSLTNLSSLDASERDIESFKRFDFFFDPPQYKPKVQLDVRNIADTMRHNK-