Monarch geneset OGS2.0

DPOGS214221
TranscriptDPOGS214221-TA1158 bp
ProteinDPOGS214221-PA385 aa
Genomic positionDPSCF300014 + 798804-802753
RNAseq coverage74x (Rank: top 66%)
Annotation
HeliconiusHMEL0164112e-4055.50% 
BombyxBGIBMGA006197-TA1e-5742.65% 
DrosophilaCG3294-PA1e-3332.02% 
EBI UniRef50UniRef50_UPI00015B41F81e-5638.51%UPI00015B41F8 related cluster n=1 Tax=unknown RepID=UPI00015B41F8
NCBI RefSeqXP_001608048.12e-5738.51%PREDICTED: hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|1565377894e-5638.51%PREDICTED: U2 small nuclear ribonucleoprotein auxiliary factor 35 kDa subunit-related protein 2-like [Nasonia vitripennis]
NCBI nr blastxgi|2700065392e-6742.99%hypothetical protein TcasGA2_TC010403 [Tribolium castaneum]
Group
Gene OntologyGO:00056342.9e-30nucleus
GO:00037232.9e-30RNA binding
GO:00001662.4e-12nucleotide binding
GO:00036762.9e-07nucleic acid binding
KEGG pathwayuma:UM03893.16e-18 
 K12836 (U2AF1)maps-> Shigellosis
    Spliceosome
InterPro domain[18-307] IPR0091452.9e-30U2 auxiliary factor small subunit
[248-312] IPR0126772.4e-12Nucleotide-binding, alpha-beta plait
[254-307] IPR0005042.9e-07RNA recognition motif domain
Orthology groupMCL11241 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214221-TA
ATGTCCAATTTTCAGTGTCCCAATGTAAACCAGAAAAAACGGCAAGTTGTAATGTCGATGCCCCTAAAACCGCATAAAGAATGGAGAAGAATTGTGAAACGGGAAAGAAGAAGGAGGATACGCAAACAGTTGGCTAAACAAAGAGATTGCTTACCTAATGGAAATGATAATAAAGATTGGATAAAAGCACAAGAAGAATTAGAAACATATATTTTTGAACAAGTTGAAAATTCTAACAAGGTTGAAAATGAAAAGTGGTTGGCCGCCGAAGCAGTAGCTTTAAAACACTGGAAGGAACTTCAAATCAAAAAAGAAATGTTGCTTAAGAAGCAGCTAGAACTACAAGCTAAGCTTCATCAGGAATGGGAGATGGAAAAAGAGAGAAAAGAAAAGGAGGCTCAACGTCTTAAAGAATTAGAAGAAGAAAATGTCAAGAGACAGGAAGAGTTCATGAAAAACTTAGAAGAATTCCTAAATGGTGATTGTAAGGATCCTCCACAGGAGCTGCTTACATTGTATGAAAGTCGGCCAAATTGTGACCCATGTCCATTTTATGCTAAAACTGCATGTTGCCGGTTTGGAGATGAATGTTCTAGAAACCACAAGTATCCAGGTATCAGCAAGATACTCTTAGCTCCAAATCTTTTTGGACATTTTGGCTTGGAGAATTCAAATTTTAATGAATATGACACAGATATTATGTTGGAATATGAAGATAGTGATACTTACAAGGATTTCAAAGAATTTTTTTTTGACATATTGCCAGAATTTCAAAAATTTGGCCAGGTTGTTGAGATTAAAGTTTGCAATAACTTTGAAAAGCATCTCCGAGGCAACACATATATAGAGTACTCCGACGTTCGTAGTGCAGTTTCAGCTTACAGAGCCCTTCATACAAGATGGTATGGCGGAAAACAACTCTCATTGCAATTTTGTAGACTGTTATCATGGAGTAGTGCAATATGTGGCAACAACACACGTCGTCACGTTCATGGAGGTGGTCCGAATCGCCTGAAAGAGAAAGTCCAACTTCAAGAAGTAAAAGAAAAGACGATGGTCATTCCAAAAGTAGAGAAAGAAGACACTATCAACATCGATCACCTAGATCAAGATCACATCGATATAGGGATTAAGTTTATGTTGATCTTAATGAAATAA

Protein sequence:

>DPOGS214221-PA
MSNFQCPNVNQKKRQVVMSMPLKPHKEWRRIVKRERRRRIRKQLAKQRDCLPNGNDNKDWIKAQEELETYIFEQVENSNKVENEKWLAAEAVALKHWKELQIKKEMLLKKQLELQAKLHQEWEMEKERKEKEAQRLKELEEENVKRQEEFMKNLEEFLNGDCKDPPQELLTLYESRPNCDPCPFYAKTACCRFGDECSRNHKYPGISKILLAPNLFGHFGLENSNFNEYDTDIMLEYEDSDTYKDFKEFFFDILPEFQKFGQVVEIKVCNNFEKHLRGNTYIEYSDVRSAVSAYRALHTRWYGGKQLSLQFCRLLSWSSAICGNNTRRHVHGGGPNRLKEKVQLQEVKEKTMVIPKVEKEDTINIDHLDQDHIDIGIKFMLILMK-