Monarch geneset OGS2.0

DPOGS215581
TranscriptDPOGS215581-TA3858 bp
ProteinDPOGS215581-PA1285 aa
Genomic positionDPSCF300097 + 133098-141105
RNAseq coverage1499x (Rank: top 9%)
Annotation
HeliconiusHMEL0100670.065.26% 
BombyxBGIBMGA010991-TA0.070.56% 
DrosophilaCG4266-PB4e-6475.17% 
EBI UniRef50UniRef50_D2CG383e-12243.25%Putative uncharacterized protein GLEAN_10693 n=1 Tax=Tribolium castaneum RepID=D2CG38_TRICA
NCBI RefSeqXP_967119.15e-12343.25%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|910942291e-12143.25%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|910942291e-16737.23%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00036762.5e-16nucleic acid binding
GO:00001662.9e-16nucleotide binding
KEGG pathway 
InterPro domain[3-142] IPR0089424.1e-39ENTH/VHS
[8-138] IPR0065691.5e-37RNA polymerase II, large subunit, CTD
[414-481] IPR0005042.5e-16RNA recognition motif domain
[411-487] IPR0126772.9e-16Nucleotide-binding, alpha-beta plait
[61-125] IPR0069031.3e-15Domain of unknown function DUF618
Orthology groupMCL18367 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215581-TA
ATGGATATGGCGGAGGTCAAAGCGTTCAACGCGGAGTTATCCGGGTTGTACGAGAATCGGCCTCCGATTTCCAAGGCAAAAATGAGTGCTATCACTAGAGGTGCTATCAAAGCCATAAAATTCTACAAGCACGTCGTACACAGTGTGGAAAAGTTCATACAGAAGTGTAAACCTGAATACAAAGTTCCTGGTCTGTACGTGATCGATTCAATAGTAAGACAGTCACGGCACCAGTTTGGCCAGGACAAGGACGTGTTCGCGCCAAGGTTCGCCAAAAACATGCAACAGACATTCGCTAACCTGTTCAGGTGTCCTGATGAAGATAAGCGCAACATAATCAGAGTTCTGAATCTGTGGCAGAAGAATAATGTGTTTGGACCTGAAGTGATCCAACCACTGCTAGATCTGGCCGACCCAAGTCATCCTTTGCATTTGGAAATACAGAACCAGAATAACACAACCAATGGAAGCATAAACATGAGTCATAACACATCAGACAGCAAGATCTCCCCCGCTCGGCAGGACTCCCCACAGACATCCTCGCCCATGGGAGATGCATTTCAAGATGACTCCTCGCCTGGTCCTCAAGCCAAATTCAACCGCAAGCTCCTCAATGATTTTGAGTATGAAAGTGAAGATGAACAAGAACCTCCCCCGCAGCCACCACATCCGCCACATCCGCCACACGCGCCACATCCGCCACATTCGAGCCACGCCACACACACGACACACAACCCCACAGATGCTCTTGGCAGTATACTAACAAATCCAGAAATTATGAGGCAGCTACAGAGTCTACAAGCCCAGATGCAGCTCATGACGGGCATGCAGATCCCAAATTTGATGCCGATGATGTCAGACATGCAACTTCAGCAGAACCAAAATTCGAACGCACCATTCTTAAACTCTCAGACAGAACAACAGAAACCCGCAGAGCCCAAGGAGGACCTCGCCAACGAGTCCGACATAGAGTTCGTGGAGACCGGACCCCAGGTCATCGAGATACCCGACGCCAACGACTCCCGGTCGCCCTCGCCGAGGCGACGACATCGCTCCCGGTCGAGGAGTCCACGCCGGAGGAGGAGGACGCGCTCCCCGCGGAGAAGGAGGGACAAGGACCGCGATCGAGACCGCGACAAGACGCACAAGGAGAGGGAGGCCGAGAAAGAGCGCCAGCGGGAGAGGGAGAAGAGGGGCCTACCACCCATCAAGAAAGAGAACCTCAGTGTGTGCAGCACCACGCTGTGGGTCGGCCGGCTGTCCAAACAGGCCACGCCCGAGGAGCTGTGGGACCTGTTCGGGGCCGTGGGCGGCGTGGCGGCCGTGGACGTGGTGGCGCCTCGGGGCTGCGCTTTCGTCGTCATGGAGCGGCGCCGGGACGCCGCGCGCGCGCTCGCCAAGCTGCACCGACACAAACTGCACTCCAAAGAGATAGACGTCGCCTGGGCGGCCGGCAAGGGCGTCAAAGGCCGCGAATGGAAGGACTACTGGGAGGCGGAGCTCGGGGTCGCTTACTTGCCCTGGAGCGCCCTCCACGCGCGCTGGCTGCTGGGCGCGCTGTCGCTGGACGCGCTCGAGGACGGCGGGGCCGTGGATGAGGACACGCTGCCGCCTTGGCTCCCGCCCAGGATACTGCCTAAGTCTGTCGGGGAGGCCGTGCCGTTGATGGGCGCGCTGCCCGCTCCGCTGCCGCTGCCCACCGGGTTGCCGCGTCTGCCGCCACCCGGCCTGGGCGCTCCGCCACCCGCCGCGGGCTACCCCGGCCTGGGCTCACTGGCGCCGCACCAGCTGCTGAACGAGTCGCCGGGCTCTTCGGCGCCGGGTCTGCAGCGCGACCCGCTGCTAGCCTTCCCGCCCGCCCTGCCTCCCCACACCATGCCGCAGCCCGGCTTGGTGGGTGGCTTCCTGGGCGGTCTGATGGGTGTCGGAGTGGGACACATGAACGTCGGCGGGCTCGTGCTACCCCTTCACCCGGCCCACGCCCACGCGGCGCACTCCCACGCGCAGGTCCACACGCATGCACACCCGCACGCCCCGCCACACGTCCCGCCGCACGCACTCGTCCCTCAGGTGGGTCAACGAGCCGAGGTGGCGGATGACGCCATGGAGCTGGACAATGACGACCAGACGGACGAGCCCCCAGCCCCCGCGGCCCCTCCGGCCCCAGCGCCGGCGCCCGCACTCGGTCTTCCGCCTCCCGCCGTGCCCCCGCCGTTGTCTATGGACCAGCTTCAGGTCCTGTTGTCGAAGCCGCCGCCGACTTTCAACTCCGCGGAGCCTCCGCCTGGGTTCAATCCGGAGTCTTTCGAGACGGAGGAGACTCCGGACGAGCGCCGCGAACGGGACAAGGAGCGACGGGACAGAGACCGGGACAGGGACCGGGACCGACGGGACCGGCGCGACGACCGACCGGACCGACCCGGGGGGCGCAGGGAGAGGGACCGGCCGCGGGACAGGGACAGGGAGAGAGATGAGCGCCGGGAGAGAGACCGAGGAGGACGGGAGAGAAGAGACCGGGACAGGGACAGGGAGAGGGACAGGTTCCCCAGGGAGAACAACAACGAGAAAAGTCAGAAGTCTCCACGGAGTCAGGCCGGCGAGGCGGGCGGCGCGGAGAAGACGCTGCAGGAGAGGCTGTGGGAGATGGCCAACGGGAAGACGAGCGACGGAGACGAGCTCGAGCCCCGAGCGGACAGGCCTCCGCTCATAGAACGACCGCCTCTCATGGAGCGGCCGCAGACAGCGGACAGCAAGGTTCGTCTCCGCGGTCCCGGCGGAGGGGGTGGTCCTCGTCCGCCGCCGCGCGCACCGTGGCTGGCTCCGCGCTTCAATGGTTTGGGTCCGCCTTTCGTACGTCCTCCATTCGAGAGGCCCCCGTTTGAGGGTCCTCCGATGTTCGAGAGGCCGCCGTTCGGCCGGATGCCGTTCGACGGCGCGCGGCCTCCTTTCGACGGTCCCCGGCCTCCGGGTCCACGCCTGCCCTTCGATGCGCCGAGGCCTTTCGACGGGCCGCGCCCCCCATTCGATGTACCGCGCCCTCCCTTTGACGTTCCGAGGCCACCGTTCGAAGGTCAGCGACCTCCCTTCGACGGACCGCGACCTCTCTTCGACGGTCCGAGGCCTCCGTTCGATGGTTTCGAAGGAGATAGATCATTCGACGGACCCAGATTCGATGGGCCCCCCGAGTTCTTCGACAGAGGCAACAGAAGATTCGATGATAGAGATTTCAACGAGAGAGGCTGGAACGGAGATAGAGACTTCGACCGAAGGACAGAATGGGAGGACAGGAGGAGAGAACGCAGAGGGAGAGATAACGAGGAACGGTTCAGGGAGCGAGGGGGGAGGGGAAGAAACTACGACGAGAGAGCGAGACCGAGAGACGAGAGGAACACCCGGAGAGACAAGGACAGGAAGTCGAGATGGGGAGCGGCGGACGAGGCGGGGCAGGGGACAGAAGACGGCAAGGGGAAAGACACCGCGAGTGAGAGACGAGAGGCAGAGAACGACGACCGGAACGAAACACACGACACGCACACTAGCAGGACCAGCGGAGAGGAACAGAGGTCAGAAGGTGACGTGGGGCGAGAGGACACCGGGGCAGGGGCAGAGAAGGAACTGGAGGGCGAGCGGCTGAAGGTCGAGGAAGACGGGGGCAGTGAACACGAACAGATTGGACAGGACGGATACCAGCAACAGGACGAGACAGGGGATAAGAAAATAACAGATACGACAGGGGAGGAAGAAAAAATACAAGCAGTGGATGACCGGGGAGGAGGGACGGAAGAAGGGAAGGAAACCGGCCCGGGGACAGGGGGGGAAACCGGGGAGGTCGGGGAGCGGGGCGAGGGGTCGGCTTGA

Protein sequence:

>DPOGS215581-PA
MDMAEVKAFNAELSGLYENRPPISKAKMSAITRGAIKAIKFYKHVVHSVEKFIQKCKPEYKVPGLYVIDSIVRQSRHQFGQDKDVFAPRFAKNMQQTFANLFRCPDEDKRNIIRVLNLWQKNNVFGPEVIQPLLDLADPSHPLHLEIQNQNNTTNGSINMSHNTSDSKISPARQDSPQTSSPMGDAFQDDSSPGPQAKFNRKLLNDFEYESEDEQEPPPQPPHPPHPPHAPHPPHSSHATHTTHNPTDALGSILTNPEIMRQLQSLQAQMQLMTGMQIPNLMPMMSDMQLQQNQNSNAPFLNSQTEQQKPAEPKEDLANESDIEFVETGPQVIEIPDANDSRSPSPRRRHRSRSRSPRRRRRTRSPRRRRDKDRDRDRDKTHKEREAEKERQREREKRGLPPIKKENLSVCSTTLWVGRLSKQATPEELWDLFGAVGGVAAVDVVAPRGCAFVVMERRRDAARALAKLHRHKLHSKEIDVAWAAGKGVKGREWKDYWEAELGVAYLPWSALHARWLLGALSLDALEDGGAVDEDTLPPWLPPRILPKSVGEAVPLMGALPAPLPLPTGLPRLPPPGLGAPPPAAGYPGLGSLAPHQLLNESPGSSAPGLQRDPLLAFPPALPPHTMPQPGLVGGFLGGLMGVGVGHMNVGGLVLPLHPAHAHAAHSHAQVHTHAHPHAPPHVPPHALVPQVGQRAEVADDAMELDNDDQTDEPPAPAAPPAPAPAPALGLPPPAVPPPLSMDQLQVLLSKPPPTFNSAEPPPGFNPESFETEETPDERRERDKERRDRDRDRDRDRRDRRDDRPDRPGGRRERDRPRDRDRERDERRERDRGGRERRDRDRDRERDRFPRENNNEKSQKSPRSQAGEAGGAEKTLQERLWEMANGKTSDGDELEPRADRPPLIERPPLMERPQTADSKVRLRGPGGGGGPRPPPRAPWLAPRFNGLGPPFVRPPFERPPFEGPPMFERPPFGRMPFDGARPPFDGPRPPGPRLPFDAPRPFDGPRPPFDVPRPPFDVPRPPFEGQRPPFDGPRPLFDGPRPPFDGFEGDRSFDGPRFDGPPEFFDRGNRRFDDRDFNERGWNGDRDFDRRTEWEDRRRERRGRDNEERFRERGGRGRNYDERARPRDERNTRRDKDRKSRWGAADEAGQGTEDGKGKDTASERREAENDDRNETHDTHTSRTSGEEQRSEGDVGREDTGAGAEKELEGERLKVEEDGGSEHEQIGQDGYQQQDETGDKKITDTTGEEEKIQAVDDRGGGTEEGKETGPGTGGETGEVGERGEGSA-