Monarch geneset OGS2.0

DPOGS212209
TranscriptDPOGS212209-TA1506 bp
ProteinDPOGS212209-PA501 aa
Genomic positionDPSCF300323 + 121020-124949
RNAseq coverage95x (Rank: top 62%)
Annotation
HeliconiusHMEL0068439e-7063.89% 
BombyxBGIBMGA001155-TA1e-8861.04% 
Drosophila% 
EBI UniRef50UniRef50_D1ZZT56e-1141.38%Putative uncharacterized protein GLEAN_08103 n=1 Tax=Tribolium castaneum RepID=D1ZZT5_TRICA
NCBI RefSeqXP_974936.11e-1141.38%PREDICTED: similar to BRAF35/HDAC2 complex [Tribolium castaneum]
NCBI nr blastpgi|910809812e-1041.38%PREDICTED: similar to BRAF35/HDAC2 complex [Tribolium castaneum]
NCBI nr blastxgi|3838536384e-1825.06%PREDICTED: uncharacterized protein LOC100876648 [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[338-382] IPR0110112.4e-06Zinc finger, FYVE/PHD-type
Orthology groupMCL26769 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212209-TA
ATGGAAGTTTCTAACGAAATAAAGGAAGATATTCGTAAAACACAAACGGATTTGAAAAATGCAATACGCATTCATCAGATATGGGTGACTCGATTGGAAGAAGATGAGAATAATATTCAACTTAAGTTTAAAGTAAATGAAGCTGAAAAAGAAATTATAGCCATAAGTCAGGCTCAAAAATTGGTAGTGGATAGACTCCGCAAGGAGTTAGAATTATATCAACAAAGACTGAGAGCCAGAAATAAACAAGTCAGTATAGAAAATGACAATAAATATGTAGCTCAGCAACTCAGGGATCACCAGTTACAATATCGCAATAGAAACCGTAATATATCACTACTAAAGCCATCGGTACTCAATGAAATACATATCAAGACGGAGCACAATGATAATAACAACTCGGATTCCGATAACGAAAATAAATACAGCGAGAAGGAGAACGCTAATAGCAGTGAAACAGATGAAAAGGAAAAATACTCACAAAATTATAATCAGATTCCCAATTTAGTATTGGGAGGCGGCAGGAATAATTTAACTCATGCCTTGAACAAAGTCAAAGAATCCTTAATTGAAAATAACATACAAGAGGAATGTCAATGGGAGGAACAAAGATCAGACTCGGATTCAACATTTGATCTGTCTCCGACACCGTCCCCTCCACCTCTACCTGAGCCGGGAAAACCCATCATTAAGGAAGTGTTCATGAGGTTGATCGGCCTGATAACTCCGGCTCAGAAAGAAATTATTGAGAGGAATAGGAACGAACGTCGCAAGAGATCCACAAATAATTCCAACAAGAATGACTTTCTATATGGAAGCTATGAAATGATGCCTAAACGTAAAAAATTAAATCAATATCCGTATCTACAAAGTCACAGTGACCCGCCTCAGACGAGATCCGCTAAATTACGTCAACAGCAGAACCAAAACAAATCATCGAGGGAGGGCTCCCCTTCCGGTTCATGCTCCGAAGTTAAAGGTTGGGGTAACAACAAGCCAGCGTGGTTGGCTTCTCTTCCAGCTGGTTTGTCCGTGGAACCAGTTTACTCTCCCACCAAGAAAGTATGTCACGGGTGTGGAAGGAACGATGTTCCATCTCTTCTGGTGTGGTGTTCGTCCTGTACTGTATGGCTGCATACAGGGTGTTCAACGAGTGGTCGATGTAGCTGTGGATCTACGCTGCCAGACCCAGCTGAGTGCAGCGTTAATTCCAATAGCGACGATATATATAGAGATAAATTAGCGGAAAGGAAGAGGCTCCAAGAAAAGAACATAGAGCTCTGTAAGGAACTGCGCAAGTTAGAAGCGAGGGCAGCGACCTTGAAGGAAAATTTGGACGAACATAACGCTGAGAAACGACAGCTGTTGGCGGATCAGATAAAAACACAGAAGAATCTACAGAAACTTCTAGACTTTATAAGCCAGTTTAAAGAGACCTCTATAAGCATACGTTCAACATCAGTGAGTGAGTCGGGCAGTGAAGTTAGTAAAAGTAATGAAGATTGA

Protein sequence:

>DPOGS212209-PA
MEVSNEIKEDIRKTQTDLKNAIRIHQIWVTRLEEDENNIQLKFKVNEAEKEIIAISQAQKLVVDRLRKELELYQQRLRARNKQVSIENDNKYVAQQLRDHQLQYRNRNRNISLLKPSVLNEIHIKTEHNDNNNSDSDNENKYSEKENANSSETDEKEKYSQNYNQIPNLVLGGGRNNLTHALNKVKESLIENNIQEECQWEEQRSDSDSTFDLSPTPSPPPLPEPGKPIIKEVFMRLIGLITPAQKEIIERNRNERRKRSTNNSNKNDFLYGSYEMMPKRKKLNQYPYLQSHSDPPQTRSAKLRQQQNQNKSSREGSPSGSCSEVKGWGNNKPAWLASLPAGLSVEPVYSPTKKVCHGCGRNDVPSLLVWCSSCTVWLHTGCSTSGRCSCGSTLPDPAECSVNSNSDDIYRDKLAERKRLQEKNIELCKELRKLEARAATLKENLDEHNAEKRQLLADQIKTQKNLQKLLDFISQFKETSISIRSTSVSESGSEVSKSNED-