Monarch geneset OGS2.0

DPOGS210169
TranscriptDPOGS210169-TA3057 bp
ProteinDPOGS210169-PA1018 aa
Genomic positionDPSCF300352 + 122864-137686
RNAseq coverage175x (Rank: top 50%)
Annotation
HeliconiusHMEL0173725e-5559.40% 
BombyxBGIBMGA013922-TA2e-3440.00% 
DrosophilaIncenp-PA4e-1728.68% 
EBI UniRef50UniRef50_UPI00020647C18e-2235.67%UPI00020647C1 related cluster n=1 Tax=unknown RepID=UPI00020647C1
NCBI RefSeqXP_001986730.13e-1729.51%GH21527 [Drosophila grimshawi]
NCBI nr blastpgi|3838557442e-2132.77%PREDICTED: uncharacterized protein LOC100880529 [Megachile rotundata]
NCBI nr blastxgi|3454847153e-4724.74%PREDICTED: hypothetical protein LOC100679418 [Nasonia vitripennis]
Group
KEGG pathway 
Orthology groupMCL21035 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210169-TA
ATGTCAATTTTTAGTGAATTGCTACCAAAACTAAATGAAATATCAAATGCGTTTACTAAAAATTTTAACGATGACCTCGAGGCCGCATTCCTCTCATTGGATAAGTTAAAGGATGAATATCTGAATTCCAAGAGTAAAAGCCGCGATAAGTCTCAAAAGGATAAGAATCATACGACAATCCTTCAAAGTATACACGAGGACGAAGATGAAACACCCAAAGCTGATGAACCGTCAGATAACACGGAGCAAAGACCTAAGAAACGTAGCAAACATGAAGTGGAAGGTATGGAGAGTCCTGAAATTGAAAAACGACAGAAACGCAAGGCATCCGTTAAAGCTCAAAGCATTATTAGCAAACAAGTCAGAGGACAAGTCAGAATTTTTGATGGAAACATCATTGTGCTATCTCCAGATGTGGCCGCATTCCTCTCATTGGATAAGTTAAAGGATGAATATCTGAATTCCAAGAGTAAAAGCCGCGATAAGTCTCAAAAGGATAAGAATCATACGACAATCCTTCAAAGTATACACGAGGACGAAGATGAAACACCCAAAGCTGATGAACCGTCAGATAACACGGAGCAAAGACCTAAGAAACGCAGCAAACATGAAGTGGAAGGTATGGAGAGTCCTGAAATTGAAAAACGACAGAAACGCAAGGCATCCGTTAAAGCTCAAAGCATTATTAGCAAACAAGTCAGAGGACAAGTCAGAATTTTTGATGGAAACATCATTGTGCTATCTCCAGATGTGGTAAATGTAAATCTAACTGAGAAACTCCGTAGAGAGAACTCAACTCAGAAGCGCAGGCGCCGAAAAGACGACGACAAAGAAAACGACCCCGAACCGACAATACGGAGTACGTACGTAAAAGAAGAGAAAATTTCACTCCCTCCCGAGCCAATAGATATCGAATCCGTGCCAGTCAATGTGGAAGTGAAGCAGGAGTTCAAAGAGGAAATAGCAATGCCGCCTCCGCCTGTACCGACCGCGCGACCCGTCCGCTCGAGTCGTGCCAGACTCACAGAACAAGAAGACAGAGAAGGAAACAGGAGAACCAGGGGGAAGAAGGCTCAAGAAACATCAACCGTTGAGGCGGAAAAGGAAACTTCTGTCGAAAACGTAACCGCGTCCCCGGCTGAGAAACCGCGGCCGAAACGTACGAGGAGAGCGAGAAAAGTTTCTGAAAAACAAGAAAACGAAGATAAAATTGAAAACGACATTGAAAAAAAGGAAAGTGATTTGGAAATAAAACCAGTAGATACGTCAGACAGCGAGGCGCGGTCGCCCGTGCTGCAGACGATAGTGCCGAAGACGAAACTCAGGGTGCGCGAGGACTGCGGAGACGACGCAGACCAGCAGGGGCGGGGCGCGCGGGCCGCGGGCGAGCAGCCGGCCGAGGGCGTGCGCGGCCTGGACTGCACCGTCACCATCAGCTGCCACATGGATGACACGGTCGTGCTGCCGCAGGCCGAGTGCCCGCGCGCGCCCGACACGCCCCCGGCGGCGAGGAAAATGAACGAGACGGTGGTGCTAGACAAGCAGAACAGAACAGAAGACAAGCGGAACGACGGGATAATGAACGAAACAGTGGTGTTGGAGAAAGGTGACTATACACAAAGAACTTCTATAATAAGGCTGCAGCTGAACCGGGCGACGACTCGCTGGGGGGCCGCGGGCGAGCAGCCGGCCGAGGGCGCGCGCGGCCTGGACTGCACCGTCACCATCAGCTGCCACATGGATGACACGGTCGTGCTGCCGCAGGCCGAGTGCCCGCGCGCGCCCGACACGCCCCCGGCGGCGAGGAAAATGAACGAGACGGTGGTGCTAGACAAGCAGAACAGAACAGAAGACAAGCAGAAGGACGGGATAATGAACGAAACAGTCGTGTTGGAGAAAGATAAGGCTGCAGCTGAACCGGGCGACGACTCGCTCTTGACTGACGACGAATCGCTGGAAATGAAGACGCCGCCTAACAGACAGCCGCCGGAGCCCACTTCCGCTGTGAAAGAGAAAGTGCAACAGTTTGAAGAAATGGCTACGAGAGTGACCCGCACTAAGACTAGAGCTATGACTAAAAAGGAGGTCCCAGTCGACCCAGACACTCAGACGCCGCCGGACAGGACGAGGCCGGTTATATCGACGGACACGCTCAGCAAGATGAACAACCTCATATTCAACGGAAAACCACCGCAGATATCATCGTCGGCGTCGAAGCCTCGTTCTAACATCCCTATGAAGACTTCGGTAACAGCCTCCGCCTCTAAGATAAGTGTCGCCAGAGACGACGAGAGAAGAGAAAAAGAGGACGCGAGAAGGAAGAAGGAGGCGATGCTAGAGGCTAAGAAGGAGATGCAACGAAGAAAGAGGGAAGAGAAAATGTCAGCGGCTGCAGCGGCTAGAACGGCGGCTGAGAACATGAGACGTGCAGCGCTTCAAGCAGCCGAGAAGGAAAGACGGGAGAGGCAGATACAGGCCGACCAGGGGAGGATGGATAGACTTAAAGAGGTCGAGAAGAAAAAGTTGGAGCAAGCACGTAAGGCTGCCGAGACAGAGGAACGAAGGAAGCTAGAGGAAGCTGCACGAGCCAGTAGACTGCAGAACGAACAAAGGAAAGTCGAGGAGGCTAGGAGGAGGCAGCTGGAGGAGGAAAAGATCATGAAGAAGGAAGCAGCTCAAATGCAGAAAGAGATAGAGCGGAGACAACGAGAGTTCATGGAGAGAATGAAGATGAAGAAATTAGAGGGAGACAGAACACCCAACAAGATGGCGGCCATAGAGCCCGTGTACATGCAGGACGGCTTCCAACACCTCAACTCCGACGAAGAAGAACCCCCGGAGAGACCACCACCAGTATGGAGCACCTCCAAGAATCGTCGCATTCAACTGTCGATCCAGTCCCGTATCAGCCAGCATCACATCGACCGTCTCTTCTCAGTGAGGGAGCACACTCCGGACCTAAGGGAGATCTTCCCTAACATAGAGCGAGCCCGCCTCAAGAGAACGTCCTCCGCCGTTTGGAGGACACCGCCCAGGCTGGCCACGCTCGACGAGTGA

Protein sequence:

>DPOGS210169-PA
MSIFSELLPKLNEISNAFTKNFNDDLEAAFLSLDKLKDEYLNSKSKSRDKSQKDKNHTTILQSIHEDEDETPKADEPSDNTEQRPKKRSKHEVEGMESPEIEKRQKRKASVKAQSIISKQVRGQVRIFDGNIIVLSPDVAAFLSLDKLKDEYLNSKSKSRDKSQKDKNHTTILQSIHEDEDETPKADEPSDNTEQRPKKRSKHEVEGMESPEIEKRQKRKASVKAQSIISKQVRGQVRIFDGNIIVLSPDVVNVNLTEKLRRENSTQKRRRRKDDDKENDPEPTIRSTYVKEEKISLPPEPIDIESVPVNVEVKQEFKEEIAMPPPPVPTARPVRSSRARLTEQEDREGNRRTRGKKAQETSTVEAEKETSVENVTASPAEKPRPKRTRRARKVSEKQENEDKIENDIEKKESDLEIKPVDTSDSEARSPVLQTIVPKTKLRVREDCGDDADQQGRGARAAGEQPAEGVRGLDCTVTISCHMDDTVVLPQAECPRAPDTPPAARKMNETVVLDKQNRTEDKRNDGIMNETVVLEKGDYTQRTSIIRLQLNRATTRWGAAGEQPAEGARGLDCTVTISCHMDDTVVLPQAECPRAPDTPPAARKMNETVVLDKQNRTEDKQKDGIMNETVVLEKDKAAAEPGDDSLLTDDESLEMKTPPNRQPPEPTSAVKEKVQQFEEMATRVTRTKTRAMTKKEVPVDPDTQTPPDRTRPVISTDTLSKMNNLIFNGKPPQISSSASKPRSNIPMKTSVTASASKISVARDDERREKEDARRKKEAMLEAKKEMQRRKREEKMSAAAAARTAAENMRRAALQAAEKERRERQIQADQGRMDRLKEVEKKKLEQARKAAETEERRKLEEAARASRLQNEQRKVEEARRRQLEEEKIMKKEAAQMQKEIERRQREFMERMKMKKLEGDRTPNKMAAIEPVYMQDGFQHLNSDEEEPPERPPPVWSTSKNRRIQLSIQSRISQHHIDRLFSVREHTPDLREIFPNIERARLKRTSSAVWRTPPRLATLDE-