Monarch geneset OGS2.0

DPOGS208703
TranscriptDPOGS208703-TA1416 bp
ProteinDPOGS208703-PA471 aa
Genomic positionDPSCF300043 - 318492-322046
RNAseq coverage1297x (Rank: top 10%)
Annotation
HeliconiusHMEL0152429e-12068.97% 
BombyxBGIBMGA003348-TA3e-10163.11% 
DrosophilaFip1-PA2e-3155.17% 
EBI UniRef50UniRef50_D6WJB81e-3838.70%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WJB8_TRICA
NCBI RefSeqXP_001607096.11e-4136.60%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|1565464113e-4036.60%PREDICTED: hypothetical protein LOC100123452 [Nasonia vitripennis]
NCBI nr blastxgi|1565464114e-7337.36%PREDICTED: hypothetical protein LOC100123452 [Nasonia vitripennis]
Group
KEGG pathway 
InterPro domain[169-211] IPR0078543e-23Pre-mRNA polyadenylation factor Fip1
Orthology groupMCL17093 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208703-TA
ATGGCAGACGCTGCTGTAGAAACCGCTCCAGCAGATGAAAATGATGATAATTGGTTATATGGAGAATCAGCGAGTGAACAAATCGAAGCGGAAGCATCAACTACTGATGAAAAAACACAAGAACAGGAAAATGCTAATAAGGAGAGAATAGATGATGATGGTAAAGTACAAGAAGAGACAGAGACAAATGAGGGTAACAATGAAGACTCTCACTTCAACGATGAACACTTTGGTGAGGTGGACAGAGATGATCAGGATCAGACGAATGGAGACGCCGACAGTCAGGACAACGGGGACACCGACTCTGATGATAGTGATGACGTCAAAGTAACGATTGGAGAAATTAAGTCAGGACCACAAGCTTATGCCAGTTTGAATATAAAACGTGGTGTTGGACTTGTAGCAGCTGGAACTGAGAAGCCTCGTCAAGGTCCAGCGGCCGGTAACAAAGTGACTCTTGAAGACTTAGATGGTCCCGGAAGCATCAACGGCGTGCCAGCGCTAGAGTTTAATATTGACACTATTGAAGATAAACCCTGGAACAAACCTGGAGCTGACATATCTGATTACTTCAATTACGGTTTCAACGAGGTGACCTGGAGCGCTTACTGTGAGCGTCAGAGACGGATGCGTGTTAGTGAGGCCGGTGTCGCTCTACACGCCGCCCCGCCGCCCCGCGCCGCCCCCACAGACAGACGGCAACAAGGTCCACCAAGACATGACGACATGCCGCCAGGGATGCCAAATAATTACCAGTCCAGAGAGAACACTATACAGGTGATGACAGCCGAGCGTCGTGAGTACGGCCGCGGTCAGGTGCGCGAGGCTGCACCACCCGCCGACTACTTCAGCGCTCCGCCCCCCGACCACTACTACCAGCCTCCGCCTCACGCGCCTCACGCTCCACACCTCCCTCCACACCAGCACACACCACACTCATACGAAGAACCCTGGGCTCATCCAGAACAGACAGGCTGGGCGCCGTCAGATATAAAGGAACTAACGCCCGGACCCATGGGACCGCCGATGCCGCTAGGCATGCCCCCCGTACACATGCCCGCGCCCTACCCGACATACCGCTCGCACGTCACACACACACACGAAAGAGACCGGGACCGGGAACGGGAAAGAGACCGAGACCGGGACCGGGACCGGGACAGGACCCGCGACGACCGTGACCGACGGGACCGGGACCGGAGGGATGAGGAGGAAAGAGACAGGGATCGTGAACGTTCACGCTCCATTAAACCGGAGAGGATACGAGAGAAATCGTACCGCCGTGAGCGGTCTCGTTCACGTTCCCGCCGTCACAAGTCCCGGTCCCGGTCTCCGAGACAACGGGAACGTTCCCGGGACAGGGAGAGGAGTATGAAGCCCAAGAACAAGGATGCCAAGGAAAAGGACGAAGATAAATAA

Protein sequence:

>DPOGS208703-PA
MADAAVETAPADENDDNWLYGESASEQIEAEASTTDEKTQEQENANKERIDDDGKVQEETETNEGNNEDSHFNDEHFGEVDRDDQDQTNGDADSQDNGDTDSDDSDDVKVTIGEIKSGPQAYASLNIKRGVGLVAAGTEKPRQGPAAGNKVTLEDLDGPGSINGVPALEFNIDTIEDKPWNKPGADISDYFNYGFNEVTWSAYCERQRRMRVSEAGVALHAAPPPRAAPTDRRQQGPPRHDDMPPGMPNNYQSRENTIQVMTAERREYGRGQVREAAPPADYFSAPPPDHYYQPPPHAPHAPHLPPHQHTPHSYEEPWAHPEQTGWAPSDIKELTPGPMGPPMPLGMPPVHMPAPYPTYRSHVTHTHERDRDRERERDRDRDRDRDRTRDDRDRRDRDRRDEEERDRDRERSRSIKPERIREKSYRRERSRSRSRRHKSRSRSPRQRERSRDRERSMKPKNKDAKEKDEDK-