Monarch geneset OGS2.0

DPOGS204326
TranscriptDPOGS204326-TA3414 bp
ProteinDPOGS204326-PA1137 aa
Genomic positionDPSCF300142 - 127249-137838
RNAseq coverage93x (Rank: top 62%)
Annotation
HeliconiusHMEL0023233e-13153.52% 
BombyxBGIBMGA000001-TA1e-12455.78% 
Drosophilaphtf-PA2e-10142.22% 
EBI UniRef50UniRef50_Q16RV63e-10443.52%Putative uncharacterized protein n=2 Tax=Culicinae RepID=Q16RV6_AEDAE
NCBI RefSeqXP_001661074.15e-10543.52%hypothetical protein AaeL_AAEL010838 [Aedes aegypti]
NCBI nr blastpgi|3071936771e-10437.85%Putative homeodomain transcription factor [Harpegnathos saltator]
NCBI nr blastxgi|1571275282e-11238.35%hypothetical protein AaeL_AAEL010838 [Aedes aegypti]
Group
KEGG pathway 
Orthology groupMCL10846 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204326-TA
ATGGTGGAACAGCTCCTGATGAGAGGAACATACAGACGTCGGATCATAGACTTCACATCCCATCCAAGATCATATCTGATAGATGTTGATCTAGTCAGAGGCAAGTTGAAAAACATTATAACCGCTTCAATGTCCTATTCATGGATAGTGCCTCTTGGTCTTATGATAGTCCTAAGTATAGTGCATTCTCAAATAGTCAGCACTACTGAAATGGAACTCGCCAGAGTAAGACCTCGTTCAGCATATCGGAGGAAACTACCTAGAAAAACCCGATCAGACTGCGATGGCGGTAGTGGCGGTGGATCTGCTGAAAGTGGTGCAACTTCAAGCCACAGCAAAACTCCTCTCACAAAATCATCAAAATTCAGGCGAAGGCTCTCGAGATCTGATTCAACTGATACAGGGATGCGGAAACGTAAAAAGGAAGTGACCAAAAATAATGACACTGTATTACATGCGTGCAAAGAAAAAAATACTGCGAAAGCAAAAGTTAAAGATTCAGACGACGAGGATTATATGTCCTGGAAAAAACCAGAAGAAACAGCGCCAGTGGTGACGTTCACTCCACCAGCTGACGAGGGGAACACCAAACGTACGTTCCGATACAAACCGAACATATTGACAAAGAAATACTTAGAATTCTTCAATGTACGCCAGAATTTAAATAGACCTATCTTCGCTGACGGTGACGACGGTTTTGAGAGTCTCAACGGCTACAACTCCCACGGCAGCGACGGAGAAATCAGGAACAGAGACACGGATAGGAAGCCACGCGAGCAAATAAAAGAAAAGCCGGCAGAGGAAAACGATCCAAAAATAATAGCGTCGGAAGAAAAAGCAGAAAGCGCGAAAACAAGCAAAGATGAAGAAGACAAATTTGTGGACCATGAATCAGATAGCGCCACAACGAATCACGGCAAAAGGGTCGGCGTGAGATTCAGGAAATCTTGGGCCAAAAACTCCGTCCACGAATCAACGGACGAAGATTACAATCTTAAAGCCAAACAAAAGAAACTAAATAACTACCAGAGTTCGTCATCGGACGGTGAGTGTTCGGCTTCAGCGCCATCTATCGCTTTACCGTCACACCATACTATGTCAGACTGGGTTGGCCAAATTACTAACAGTGAAGAGAGCAGTTACGGATCCCAATCCGAAGCCGGTCACTCCGATGTGTTTCATTACACAGCCGACAGCTCTTGGGATCCGTTCGCTATTTTGGATCCTTCCAGCGACACTGATTTCATAGCCCCGGTGTCTTTGGACATTGATTCCTATTCATGGATAGTGCCTCTTGGTCTTATGATAGTCCTAAGTATAGTGCATTCTCAAATAGTCAGCACTACTGAAATGGAACTCGCCAGAGTAAGACCTCGTTCAGCATATCGGAGGAAACTACCTAGAAAAACCCGATCAGACTGCGATGGCGGTAGTGGCGGTGGATCTGCTGAAAGTGGTGCAACTTCAAGCCACAGCAAAACTCCTCTCACAAAATCATCAAAATTCAGGCGAAGGCTCTCGAGATCTGATTCAACTGATACAGGGATGCGGAAACGTAAAAAGGAAGTGACCAAAAATAATGACACTGTATTACATGCGTGCAAAGAAAAAAATCCTGCGAAAGCAAAAGTTAAAGATTCAGACGACGAGGATTATATGTCCTGGAAAAAACCAGAAGAAACAGCCCCAGTGGTGACCTTCACTCCACCAGCTGACGAGGGGAACACCAAACGTACGTTTCGATACAAACCGAACATATTGACAAAGAAATACTTAGAATTCTTCAATGTACGTCAGAGTTTAAATAGACCTATCTTCGCTGACGGTGACGACGGTTTTGAGAGTCTCAACGGCTACAACTCCCACGGCAGCGACGGAGAAATAAGGAACAGAGACACGGATAGGAAGCCACGCGAGCAAATAAAAGAAAAGCCGGCAGAGGAAAACGATCCAAAAATAATAGCGTCGGAAGAAAAAGCAGAAAGCGCGAAAACAAGCAAAGATGAAGAAGACAAATTCGTGGACCATGAATCAGATAGCGCCACAACGAATCACGGCAAAAGGGTCGGCGTGAGATTCAGGAAATCTTGGGCCAAAAACTCCGTCCACGAATCAACGGACGAAGATTACAATCTTAAAGCCAAACAAAAGAAACTAAATAACTACCAGAGTTCGTCATCGGACGGTGAGTGTTCGGCTTCAGCGCCATCTATCGCTTTACCGTCACACCATACTATGTCAGACTGGGTTGGCCAAATTACTAACAGTGAAGAGAGCAGTTACGGATCCCAATCCGAAGCCGGTCACTCCGATGTGTTTCATTACACAGCCGACAGCTCTTGGGATCCGTTCGCTATTTTGGATCCTTCCAGCGACACTGTGAAATGTACAATGTGGGAGCGTGGTTGTACTCTGCGCGCTGAATTGTCAGCTGTTGATATAAGTTGGTACGTGGTGGCTCGGGCGGAGCGCGCTATGTCCGACGGCGGGGTCTGGCCGGGGCTGTTCATGGCGAGCCTAGTGGCTGTAGTGTCACCCTTTATGAGACTTGTACAGGTGGCTATAGAGAAGGACACGCGCAGTGAAGATGAGCTGCAGAACATTTCTCTCATCAGCTACATTCCATCTCTTGTGGTGAACTATACCCAGGGCTCGATGGTTTGCGTTTTCAACGGAGCTCTCGGAGACAGCTTTTGGGAGATATCCTCGAACGTACTATCATGTGTATTACGTTTCGCTCTAAGCGCTCTAGTGTTCTTCCTCCTGGCGGTCGCTGAGCGCGCCTACAAACAGAGATTCCTTTACGCAAAGCTTTTCTCGCATCTAACGTCGGCGAGGCGAGCAAGGAAATCAGAATTGCCGCATTTTAGATTAAATACAGTCAGAAATATAAAGACGTGGCTGTCAACTAGATCATATCTGCGGCGTCGTGGACCGCAGAGGTCGGTTGATGTGATAGTATCGGCTGCTTTTATGTTGACCCTCACATTACTTGCTTGTGTCAGCGCACAACTATTAAGGGACTCGGTTACTCTTGAGAGGGGCTGGTTGTTAGAAGCTATGGTTTGGAGCTGTTGCCTCGGTATATATCTCCTTCGTCTGCTCACCCTCGGCAGTAACGTGAACAGGAAGTACCGCGGATGTCTCTCAGCGATACTCACAGAACAGATCAACTTACATCTGGCGATAGAACAGCGACCCGAGAGCAAAGAACAACTCACCGTAGCCAACAATGTCCTTAAATTGGCCGCAGATTTGCTAAAGGAATTGGATTCGCCGTTTAAGATATCAGGGATATGTGCAAATCATTATCTCTACACCATAACTAAAGTCGTGATACTCTCCGCGCTGTCTGGAGTCTTATCTGAAATGTTAGGATTTAAGTTGAAATTGCACAAAATTAAAATTAAATAA

Protein sequence:

>DPOGS204326-PA
MVEQLLMRGTYRRRIIDFTSHPRSYLIDVDLVRGKLKNIITASMSYSWIVPLGLMIVLSIVHSQIVSTTEMELARVRPRSAYRRKLPRKTRSDCDGGSGGGSAESGATSSHSKTPLTKSSKFRRRLSRSDSTDTGMRKRKKEVTKNNDTVLHACKEKNTAKAKVKDSDDEDYMSWKKPEETAPVVTFTPPADEGNTKRTFRYKPNILTKKYLEFFNVRQNLNRPIFADGDDGFESLNGYNSHGSDGEIRNRDTDRKPREQIKEKPAEENDPKIIASEEKAESAKTSKDEEDKFVDHESDSATTNHGKRVGVRFRKSWAKNSVHESTDEDYNLKAKQKKLNNYQSSSSDGECSASAPSIALPSHHTMSDWVGQITNSEESSYGSQSEAGHSDVFHYTADSSWDPFAILDPSSDTDFIAPVSLDIDSYSWIVPLGLMIVLSIVHSQIVSTTEMELARVRPRSAYRRKLPRKTRSDCDGGSGGGSAESGATSSHSKTPLTKSSKFRRRLSRSDSTDTGMRKRKKEVTKNNDTVLHACKEKNPAKAKVKDSDDEDYMSWKKPEETAPVVTFTPPADEGNTKRTFRYKPNILTKKYLEFFNVRQSLNRPIFADGDDGFESLNGYNSHGSDGEIRNRDTDRKPREQIKEKPAEENDPKIIASEEKAESAKTSKDEEDKFVDHESDSATTNHGKRVGVRFRKSWAKNSVHESTDEDYNLKAKQKKLNNYQSSSSDGECSASAPSIALPSHHTMSDWVGQITNSEESSYGSQSEAGHSDVFHYTADSSWDPFAILDPSSDTVKCTMWERGCTLRAELSAVDISWYVVARAERAMSDGGVWPGLFMASLVAVVSPFMRLVQVAIEKDTRSEDELQNISLISYIPSLVVNYTQGSMVCVFNGALGDSFWEISSNVLSCVLRFALSALVFFLLAVAERAYKQRFLYAKLFSHLTSARRARKSELPHFRLNTVRNIKTWLSTRSYLRRRGPQRSVDVIVSAAFMLTLTLLACVSAQLLRDSVTLERGWLLEAMVWSCCLGIYLLRLLTLGSNVNRKYRGCLSAILTEQINLHLAIEQRPESKEQLTVANNVLKLAADLLKELDSPFKISGICANHYLYTITKVVILSALSGVLSEMLGFKLKLHKIKIK-