Monarch geneset OGS2.0

DPOGS204243
TranscriptDPOGS204243-TA1326 bp
ProteinDPOGS204243-PA441 aa
Genomic positionDPSCF300046 - 490980-492305
RNAseq coverage753x (Rank: top 17%)
Annotation
HeliconiusHMEL0151720.089.40% 
BombyxBGIBMGA007518-TA0.085.93% 
DrosophilaTBPH-PC4e-11062.89% 
EBI UniRef50UniRef50_B0X5065e-9765.20%TAR DNA-binding protein 43 n=1 Tax=Culex quinquefasciatus RepID=B0X506_CULQU
NCBI RefSeqXP_001602436.13e-11757.83%PREDICTED: similar to TBPH CG10327-PB [Nasonia vitripennis]
NCBI nr blastpgi|2700119605e-11953.33%hypothetical protein TcasGA2_TC006055 [Tribolium castaneum]
NCBI nr blastxgi|3454904535e-12653.73%PREDICTED: TAR DNA-binding protein 43-like [Nasonia vitripennis]
Group
Gene OntologyGO:00001667.3e-19nucleotide binding
GO:00036761.4e-16nucleic acid binding
KEGG pathwayrcu:RCOM_06556102e-18 
 K12741 (HNRNPA1_3)maps-> Spliceosome
InterPro domain[120-196] IPR0126777.3e-19Nucleotide-binding, alpha-beta plait
[120-190] IPR0005041.4e-16RNA recognition motif domain
Orthology groupMCL11120 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204243-TA
ATGAGTAATAAAAGTAATTATAACGAAGACCTTTTCGAATACATTAAAGTTAGTGAAGACGAAGATGATTCCAACGCAGTTGAGATACCTTGCGAACATGATGGCACTATGTTGTTGTCAACACTTGTGGCACAATTTCCTGGCGCGTGCGGATTAAAATATCGTCATCCTGATAGTAAAGCTACTCGCGGTATTCGTTTAAGCGATGGTAAGCTTCATCCACCTAGTGATGCCGGCTGGGGTAAACATTTGTACATTTGTGTATTTCCAAAAGAAAACAAACGAAAGATGGAAGACGTTTCGCCGGAAAATTCAGCGGCTAAAACAAAACGCTTAGAAAAAAAACTCACTTGTTCGGATTTGATTTGTCTTGGTTTACCATGGAAAACAACTGAAGAATCTATAAAACAATACTTTGAACAGTTTGGTGAAGTTGTTATGGTTCAGTTAAAACGTGACAAAAACGGCTCTTTTAAAGGATTTGGATTTATACGTTTTGCAACTTACGCCTCACAAATGAGAGCTCTTGCTCAGAGGCACAATATTGACGGGCGCTGGGTTGATGTTCGCATTCCCAATTCTAAAGAAGGGGTAGTTCCACAAATGCCGTGTAAAGTGTTTGTAGGCAGATGTACAGAGGATATGACGGCAGACGATTTGAGGGATTATTTTTCTCGATTTGGTGAAGTAACAGACGTGTTTGTACCAAGACCTTTCAGAGCATTTGGTTTTGTTACCTTCTTAGATCCTGAAGTAGCTCAAAGTCTGTGTGGAGAGGATCATGTTATCAAAGGTGCATCAGTTTCAGTATCAAGTGCTGCACCAAAAATTAAAAGTAAATCCAATCCTAACTGGAAAGATGACTCTTATGGTCCAAGCAACTGGGAAGGAGGTCGATCTGGGTCTTCTGGTTCCGGTGGTAATGGAAGTAATAATAACATTGATACATTAAATATGCAGAACTTAGGTATTAATCCTAATGGTGGACCGCCAGCTAATTTCAATTTGCCAATAAGTCTTGTTGCTGCTGCACTTAATCAAGCTGGTTGGGGTGGATTCTTAGGAGGACCAGGAAATTCAAATAATTGGCAAGGACCTCCTCAAGGTAATTCAGGCAAAGGTTGGAACAATAATCAAAATTGGGGTCAAAATGGTCCACCAGCCTGGAGTCAGAGCAGTAATCCTGGGTGGTCTGGAAGTAAGAGCGGGAACTGGAATCAAGGAAATTGGCCTAATGGACAAAATTGGGGAGGTAATGGTGGAGGTGGCTCAGCTCCCTCAGGATCTAGCAGCGGGTGGAATAATAACAACAAGCCCCAGACATGA

Protein sequence:

>DPOGS204243-PA
MSNKSNYNEDLFEYIKVSEDEDDSNAVEIPCEHDGTMLLSTLVAQFPGACGLKYRHPDSKATRGIRLSDGKLHPPSDAGWGKHLYICVFPKENKRKMEDVSPENSAAKTKRLEKKLTCSDLICLGLPWKTTEESIKQYFEQFGEVVMVQLKRDKNGSFKGFGFIRFATYASQMRALAQRHNIDGRWVDVRIPNSKEGVVPQMPCKVFVGRCTEDMTADDLRDYFSRFGEVTDVFVPRPFRAFGFVTFLDPEVAQSLCGEDHVIKGASVSVSSAAPKIKSKSNPNWKDDSYGPSNWEGGRSGSSGSGGNGSNNNIDTLNMQNLGINPNGGPPANFNLPISLVAAALNQAGWGGFLGGPGNSNNWQGPPQGNSGKGWNNNQNWGQNGPPAWSQSSNPGWSGSKSGNWNQGNWPNGQNWGGNGGGGSAPSGSSSGWNNNNKPQT-