Monarch geneset OGS2.0

DPOGS203122
TranscriptDPOGS203122-TA2214 bp
ProteinDPOGS203122-PA737 aa
Genomic positionDPSCF300094 + 161124-193207
RNAseq coverage640x (Rank: top 20%)
Annotation
HeliconiusHMEL0160427e-15872.73% 
BombyxBGIBMGA001447-TA3e-7474.87% 
DrosophilaSu(Tpl)-PB3e-3845.76% 
EBI UniRef50UniRef50_Q7PMS01e-3830.43%AGAP004795-PA n=4 Tax=Culicidae RepID=Q7PMS0_ANOGA
NCBI RefSeqXP_318017.42e-3930.43%AGAP004795-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3800200922e-4145.85%PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II elongation factor ELL2-like [Apis florea]
NCBI nr blastxgi|1892420361e-8234.29%PREDICTED: similar to AGAP004795-PA [Tribolium castaneum]
Group
Gene OntologyGO:00063683.2e-68transcription elongation from RNA polymerase II promoter
GO:00080233.2e-68transcription elongation factor complex
KEGG pathwaymdo:1000140412e-09 
 K06088 (OCLN)maps-> Pathogenic Escherichia coli infection
    Leukocyte transendothelial migration
    Tight junction
    Cell adhesion molecules (CAMs)
InterPro domain[87-420] IPR0194643.2e-68RNA polymerase II elongation factor ELL
[602-702] IPR0108445.6e-28Occludin/RNA polymerase II elongation factor, ELL domain
Orthology groupMCL18977 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203122-TA
ATGGCGGCCTTGCCAGCAGGTGTTCAGTATGCGTTATCATCGGAGGCTAGTTATAAAGAAAACAAGGAACTAGTGTTTGTGAAATTAACGGATTCAGCACTAAAAGCTATAGAAGACTTCATTAGAAATAATAGGGGTCCGGGGTCGTCGCACCCTCTTCCCTCAAATCTAACCACCTGGTTGTCCCTGTCCAACAGATGTCAGTCTCCCGAGGAATTGCGTTACTATAAAGAGGTTTCCTGCGTGGATATAGACAAATTGGCAAAACCTAAAATAGAGTTCCTACCCGGAAATCAAGGGAAAATTTCCATTCCGACTCCAAGTAGTAGCAATGGGACATCGTCGGAGTCAACGTTTCGTTTCAGCATCAACAGCAATGCGGAAATGGAGGGTCCACAGGGTTCATTCGAGTGCGTTAGGAGCGGTGGCGCCAAACGCCTGGAATCGTGCGGGCCGCTGCCGAGACGGATGCGAGTACAGGCCAACGACGACTCATACGAGGCCACCAAGGACCGTATGTCCAGGACCATAGCGGCCGAACAGAGCAAATGCACCCGCGTCATTAAACCAAATCAAACGGATATCGGGAGACGCGTCAAGGTACGCTCCTGTGGGTTCTCGAATGGACCAGCGGCTCAGCTCGCCGCGCGGTTAGAGAGGGCTGAGCGTCCTGAGCGTCCTGAGCGTCCTGAGCGTGCTGAGCGTCCTGAGCGACGACTCGCTGGACTTGCGGGACTCGCTGGACTTGCGGGACTCGCACCCGAGCGACCTGAACGCCCTGAGCGACAGGATCGTCCCGAACGTCAGGAATGGCCCGAAAGACCCGAGCGACAGGAACGTCCCGAGAGGCCCGAGAGGCCTGAACGGCAAGAGAGGCCGGAGAGAGTAGAAAGGGTTGAGAGACCGGCGCCGGCGCCCCGGCCGCAGCCCCCCGCCCCGGTCGCCGCGCCCCCGCACACACAGCCGCAGCGACCACCCCCCAATCCGGACCTCACCAGGCGGCCTCTTAAAGAAAGACTTATACAATTATTAGCACTGAAACCCTTCAAGAAGCCAGAACTATACGCAAGACTCATAAGCGAGGGTATCAAAGAAAAGGAACGCAGTATGGTTAACAAGATACTGCCAGAGATCGGAACGCTCAAGGACAATTGTTACCACTTACGTAGACATATATGGAACGATGTTAACGAAGATTGGCCTTTCTACACCGAAGAAGAGAAACATATGCTGAAGAGGCGGAAACCTCAAAATCTAACACCGCCTTTGAGCAGCGACTCCGCCAATTCTCTGTCACCTCGCGCGTCCCCCGGCAAGCGGCCGTCGGTCCCAGGAGACGAGCTGCCGGCGAAGAAACAGCGCATCTCTCACTACAGACGACCTTCACCACCCTCCTCAGGGTACGCCACCACCTCCTCCGGCGAGCGACACGCCTCTGACAATGAGGACGACCGCACCGCGAGCGAGGAGGCGCCTGTAAAGAAGCAGCGCGTGTCTCAATCAAGACGGTCTTCACCACCCTCCTCAGGGTACGCGACCACCTCCTCCGGCGAGCGACAGGCCTCTGACAATGAGGACGAACGGAACGTTAAAAAGGACAATGGGTATACTCTCAACTTCACAACCGTTAAAGATCTCTGTCCAAGTCCTGTTAAAACGAATGGTTTCAGTAGAAGTAGTCCTCCTGTAGAGCAAACGAGTATCACAGTAAAAGACATCACAACGGAACCATTAGAAAATACAGCTTTAACGTCCGTGCCTGAAGAAAATAATACGACTGATCTGGTAGACATTGAAAGGCAATATCCTCCAATAACAAGTTCCAGTACCCGTCGCGCGTACAAGAACGAGTTTGCGAATCTGTACACGGAGTATCAATCGTTGTACGGTCGCGTGGCACAGGTGGCAGCACTGTTCACACAGTTAGAACAACAACTCAAAAGAGCCGAGCCTAAGAGCCCACACCATAGGAGCATAGAACAACGTATTGTAGAGGAGTATCATCGTATGCGTAACGACGCCGACTATCAACGTGAGAGGCGGCGCGTCAATTATTTACATCGTAAACTAAACCACATCAAGAGAATGGTGCATCAGTACGACCAGCTGAACACATATGACATCAAACGCAAACTCGTTGTACAGACGCTGTTGTTTATACGTAACCTGAAACCGGAGCGCGTGTCAGCTGCCAGTACTACACAGGCGTACTGA

Protein sequence:

>DPOGS203122-PA
MAALPAGVQYALSSEASYKENKELVFVKLTDSALKAIEDFIRNNRGPGSSHPLPSNLTTWLSLSNRCQSPEELRYYKEVSCVDIDKLAKPKIEFLPGNQGKISIPTPSSSNGTSSESTFRFSINSNAEMEGPQGSFECVRSGGAKRLESCGPLPRRMRVQANDDSYEATKDRMSRTIAAEQSKCTRVIKPNQTDIGRRVKVRSCGFSNGPAAQLAARLERAERPERPERPERAERPERRLAGLAGLAGLAGLAPERPERPERQDRPERQEWPERPERQERPERPERPERQERPERVERVERPAPAPRPQPPAPVAAPPHTQPQRPPPNPDLTRRPLKERLIQLLALKPFKKPELYARLISEGIKEKERSMVNKILPEIGTLKDNCYHLRRHIWNDVNEDWPFYTEEEKHMLKRRKPQNLTPPLSSDSANSLSPRASPGKRPSVPGDELPAKKQRISHYRRPSPPSSGYATTSSGERHASDNEDDRTASEEAPVKKQRVSQSRRSSPPSSGYATTSSGERQASDNEDERNVKKDNGYTLNFTTVKDLCPSPVKTNGFSRSSPPVEQTSITVKDITTEPLENTALTSVPEENNTTDLVDIERQYPPITSSSTRRAYKNEFANLYTEYQSLYGRVAQVAALFTQLEQQLKRAEPKSPHHRSIEQRIVEEYHRMRNDADYQRERRRVNYLHRKLNHIKRMVHQYDQLNTYDIKRKLVVQTLLFIRNLKPERVSAASTTQAY-