Monarch geneset OGS2.0

DPOGS213853
TranscriptDPOGS213853-TA2211 bp
ProteinDPOGS213853-PA736 aa
Genomic positionDPSCF300361 - 114356-124252
RNAseq coverage708x (Rank: top 18%)
Annotation
HeliconiusHMEL0103080.075.72% 
BombyxBGIBMGA009660-TA0.064.26% 
DrosophilaCG8230-PA4e-16943.62% 
EBI UniRef50UniRef50_Q7KNA07e-16743.62%Dymeclin n=13 Tax=Diptera RepID=DYM_DROME
NCBI RefSeqXP_001986910.12e-17143.99%GH20269 [Drosophila grimshawi]
NCBI nr blastpgi|1950280863e-17043.99%GH20269 [Drosophila grimshawi]
NCBI nr blastxgi|1953798933e-16443.11%GJ21190 [Drosophila virilis]
Group
KEGG pathway 
InterPro domain[1-712] IPR0191423.2e-152Dymeclin
Orthology groupMCL12423 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213853-TA
ATGGGTATAGCAGTGAGCAAATATTCAGATATTGCAAATAATGAGTTGGTGGCTAAATTCTGCTCCAAGGAGGTTATATGTCCAAATGACCCGTTTTGGAACCAACTCCTTTCTTTCAACATACAGATACCAACTAATGCTGATGAACAGTTGATGTTTGACTCGAGCGCTGAGTCACTTCTACAGAAGTTTCTCCAGAACAATCCTCAGACGGGTAACTTCGGGTCACTGGTACAGGTTTTCATCACCAGGGCCACAGAACTGTTGGCTGCACCAAATTCCGACAATGTAATGCTCGCATGGCAGAGTTACAATGCTCTGTTTGTGGTCCGAGCCGTAGCCAAATATCTTGTCGAGTTGGTCCCCGAGTATGAAGTCTGCAAACATCTCGACGTACAAGCTGCTGTGGTACCTCCCCTGGACCAATCTCGCCCGTCCTCCCCTCCAGAGACGGAACAGCGAAGCAACGAAAGTCGCGTGGAAATGTTGATAGACGCCCTCATAGGACTCATCATTGATGTGCCAGTTACTGACAATACCTACTACCTACACATGGAATGCATCAATACGTTGTTGGTGCTGATGTCTGTGTACATGTTTTCTGGTGTACACGGGCAGTCCAAACTTGTGGAGACGTCGTTAATATATAGGACTCTGTTCCAAGGTCGCTACAGTATGCACGCATCTACCCTCGTCAAGACGCTCCTAACCAATCTCTGTCAGATGAAGCCGGCCCCGCCACAGTTCGGTGACACGGGCCCAGGCAGCATGCTCATTAATATAGCATCAAGTCTTTGGAGTATATTGACATTCGCTCCAACCGCCCGTCAAACTATTTACACCTGCACACCGCTGGACAGGACAGAGCTGATGAGCAGGTTGAAGCTGGAACATAGTGTGGCCAACCAGTCGTGTCTGTTGCTGCTAGCTCTGGCCAATCATTGCATCAACGACGACAACATGTATAGAACAGCGTTGCTTCAATGCGAGGACACTGCCGAAACATCGACCAAAGAAACTAATGGCAGTGTCACTCAGTCGACCGCACCGAGAATTGACATGGCTGCTCTCCATAAGGCTTTATGCTGTACCGCTGGCTGTGAACATTCAACTCTACTGCTGTACCTGATGCTGCACTCCTGCAAAGCCTACAAGAAGCATGTCAGTCAGGTTCCGGATATAGAAAACCTGATAGTACCGATCCTTCAAGTTCTGTACAACGCTCCTGAGAGTAATTCACACCATATATATATGTCACTTATAGTGCTGTTGATATTGACGGAAGATGACGCTCTCATTAAAAATGTTCATCTAATTCTGCATAAGGCTTTATGCTGCACCGCTGGCTGTGAACATTCAACCCTACTGCTGTACCTGATGCTGCACTCCTGCAAAGCCTACAAGAAGCATGTCAGTCAGGTTCCGGATATAGAAAACCTGATGCTAAAGAACCTTCCATGGTATACGGAGCGCTCAATATCAGAGATATCCCTCGGAGGTCTAATGGTTCTGGTTGTTGTGAGAGCCCTTCAGTATAATATGGCCAGGGTTAGGGACAAATATCTTCATACAAACTGTTTAGCTGCCATCGCTAATATGAGCTGTGAGTTCAGAAATCTACATCCCTATGTGAGCCAAAGGCTTATATCATTATTTGAAACCTTAACAAAACGGAGGACGAGGTTGTGTAGTGAAATAGAAGGTGATAGTATAGGCTTGCCGCATCATATTTCCGTGTGTGATGTGGAGAAGACAGAGGAAATTATTGAACACATAGCGGTCCTGGATGAAGTACTCCGGATGTTGTTGGAGATTATCAACTCGTGTCTGACACATCAGCTGGTGAACAACCTCAACCTGGTGTACGCTCTGCTGCATAAGAAACAGCTGTTCCAGCAGCATCAGCACTTACATATAGCCCAGAATATTGAAATGGTCATAGGATACTTTTCGACCCGTCTCCAGAGGGTGCAGGAGGGTGCCGGTGGTGATCTCGGTGTGAATGAAGTGCTGCAGTGTATTAAGAAAGGCGCTGAACAGTGGTCCAGTGATAGACTAAAGAAATTCCCCGATTTGAAGTTCCGTTATGTTGAAGAGGACAGACCCGAGGAGTTCTTTACACCGTATGTGTGGGCTTTGATCAGCGTCTGTGGTAATATATACTGGGCTAGCGAATGTGGGACGCGAGCCGCTGGGGAATTGCTGGCGTGA

Protein sequence:

>DPOGS213853-PA
MGIAVSKYSDIANNELVAKFCSKEVICPNDPFWNQLLSFNIQIPTNADEQLMFDSSAESLLQKFLQNNPQTGNFGSLVQVFITRATELLAAPNSDNVMLAWQSYNALFVVRAVAKYLVELVPEYEVCKHLDVQAAVVPPLDQSRPSSPPETEQRSNESRVEMLIDALIGLIIDVPVTDNTYYLHMECINTLLVLMSVYMFSGVHGQSKLVETSLIYRTLFQGRYSMHASTLVKTLLTNLCQMKPAPPQFGDTGPGSMLINIASSLWSILTFAPTARQTIYTCTPLDRTELMSRLKLEHSVANQSCLLLLALANHCINDDNMYRTALLQCEDTAETSTKETNGSVTQSTAPRIDMAALHKALCCTAGCEHSTLLLYLMLHSCKAYKKHVSQVPDIENLIVPILQVLYNAPESNSHHIYMSLIVLLILTEDDALIKNVHLILHKALCCTAGCEHSTLLLYLMLHSCKAYKKHVSQVPDIENLMLKNLPWYTERSISEISLGGLMVLVVVRALQYNMARVRDKYLHTNCLAAIANMSCEFRNLHPYVSQRLISLFETLTKRRTRLCSEIEGDSIGLPHHISVCDVEKTEEIIEHIAVLDEVLRMLLEIINSCLTHQLVNNLNLVYALLHKKQLFQQHQHLHIAQNIEMVIGYFSTRLQRVQEGAGGDLGVNEVLQCIKKGAEQWSSDRLKKFPDLKFRYVEEDRPEEFFTPYVWALISVCGNIYWASECGTRAAGELLA-