Monarch geneset OGS2.0

DPOGS214410
TranscriptDPOGS214410-TA1707 bp
ProteinDPOGS214410-PA568 aa
Genomic positionDPSCF300069 + 119417-122478
RNAseq coverage78x (Rank: top 65%)
Annotation
HeliconiusHMEL0128772e-12650.19% 
BombyxBGIBMGA011377-TA2e-7140.62% 
Drosophila% 
EBI UniRef50%
NCBI RefSeq%
NCBI nr blastp%
NCBI nr blastxgi|1234164383e-0922.70%viral A-type inclusion protein [Trichomonas vaginalis G3]
Group
KEGG pathway 
Orthology groupMCL25843 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214410-TA
ATGGACAGACATAACTATGAAACTGAATTGGTGAAGAAAATAGAGTTCACTGAAGACTTGAATAATGCTTACAATAAATCACACAGGAATGTATTTGATACAGAAGACATGGAATGGCGTGATCGAAGTCCAGCACCTAATATAAACATATATAAAGTATTGCCAAAGTCTAAAGACGTAAATAGCAAGGTTATAACAGAAAAAGGACTACTGAAACTATTGACAATGTTGACCAGAACGTTTAAGAAGATTATGAAACAACATCACGACATAAAGAGAATTCACAGCACTATTAACAATTTAAACAACGATTTTATCCAGAACATAAGAGATTTGACAAACAAATATGAAGATATTAATGTAAAATATTCAAAAATAATGAAAGTGAACGATGAACTTCGATTTACTGAAGCTAAAATAAAAGATAAAGAAGTCAATTACGATAAAAGGGAAAGAGAGCTGTCCAAAAACTTACGCGACTTCCAAAACCAACAGAAAAAGTTCCTTGATCAGCAAAGAAAATTTTATAACATGCAAAAAGTTATGCTCGCACAGAACGAGAAAATTAATATGAAACAGAATGTTATCGCGCAAACGCAAAGCGAAATTTCACGGAGGCAGCACCATTTCGCAAGAATTTTTAAGAAAGCCAAAGAAATTTACACAGAATCCAGAAAATTGGTTCCAAATAAATTGAACTCGGCGATCGCTAAAACTGAAACCAAGCCGGTTACTGAAACTGAGGATCCTATAACAGTTTTGACGACATCAGCACCCTCCCCAACAACTGAGCCGGTCAAAATAAACCTCTTTTCAATACCACCGATGAACAAATTAAAAAATCAAGATCAAAAAATATTAGAAGAGAAAGACGAACAGACGGTCGACGATTTAATATATAAATATTATTTCAACAACACTTTCATAGACGAGCTTATGAAAAATAAAATTCTTGCATCATTCGGAGCTGTAAATGACAACAGCGACAATAATAACAACAAGAAAAAAAGAAACGAAAGCAAATTGAGAACAACCATCCTGTTCCCCGTAAATAAAAAACATGAAAATAAAAATGAATCGAGAAGAAAAAGATGGATAAGACATGTCAACAAAAACAAAAAGAAACTTGTCCCACCGCAACCACCGGTCAGAACGAACAACCCAGTAATACCTGACCGTGGTAACCAAACAAATGTTATAGATTTAAACAACATGAATAATGATCCTTTCGTGACAATGGCCTACAATTTTTGCAAAGAAATAGGACAGAACGTCAACCTGCAAATACTTAAATGGTGTATAGAAAAAGCGTTGAGGCGATTAAAAGCTATTGATCTCGTGGCGCCATTTACAATGACGACTGTGGGTAAGAAGAAAACGGAGGAGAAATTTAGTACAACTGCCAAACTAGCTACAGATGTTAGGACAACTTTAAATAACGATGAATTAGAGAGTAAAATAAAGGAATACGAATTGCTCCCCGATCCCGAGGGAACGGTTTACTTTGACGGTAGCTTACACGCGAGTGATCTTGGATTAGTACAGAAAAATGGTGATTTAGACAGTGAGGGTTTTTCAGACATTATGCCTGGTTTGGAAAGCAACTCCAAGGTGGAAGTAGATCCTTTAGCATTTGACCTTCAGGCGCAGAGGAGAGCCAACGTTCGCAGAATAAACGAAAAAATCATAAACATGAAGCGGGGATGA

Protein sequence:

>DPOGS214410-PA
MDRHNYETELVKKIEFTEDLNNAYNKSHRNVFDTEDMEWRDRSPAPNINIYKVLPKSKDVNSKVITEKGLLKLLTMLTRTFKKIMKQHHDIKRIHSTINNLNNDFIQNIRDLTNKYEDINVKYSKIMKVNDELRFTEAKIKDKEVNYDKRERELSKNLRDFQNQQKKFLDQQRKFYNMQKVMLAQNEKINMKQNVIAQTQSEISRRQHHFARIFKKAKEIYTESRKLVPNKLNSAIAKTETKPVTETEDPITVLTTSAPSPTTEPVKINLFSIPPMNKLKNQDQKILEEKDEQTVDDLIYKYYFNNTFIDELMKNKILASFGAVNDNSDNNNNKKKRNESKLRTTILFPVNKKHENKNESRRKRWIRHVNKNKKKLVPPQPPVRTNNPVIPDRGNQTNVIDLNNMNNDPFVTMAYNFCKEIGQNVNLQILKWCIEKALRRLKAIDLVAPFTMTTVGKKKTEEKFSTTAKLATDVRTTLNNDELESKIKEYELLPDPEGTVYFDGSLHASDLGLVQKNGDLDSEGFSDIMPGLESNSKVEVDPLAFDLQAQRRANVRRINEKIINMKRG-