Monarch geneset OGS2.0

DPOGS204455
TranscriptDPOGS204455-TA1293 bp
ProteinDPOGS204455-PA430 aa
Genomic positionDPSCF300002 + 331340-339750
RNAseq coverage567x (Rank: top 22%)
Annotation
HeliconiusHMEL0081546e-9281.82% 
BombyxBGIBMGA013473-TA6e-8478.26% 
Drosophilaglo-PA1e-3239.78% 
EBI UniRef50UniRef50_G4VLD54e-4931.31%Putative heterogeneous nuclear ribonucleoprotein (Hnrnp) n=4 Tax=Schistosoma mansoni RepID=G4VLD5_SCHMA
NCBI RefSeqXP_002581068.17e-5031.31%heterogeneous nuclear ribonucleoprotein [Schistosoma mansoni]
NCBI nr blastpgi|3228014572e-7040.14%hypothetical protein SINV_07549 [Solenopsis invicta]
NCBI nr blastxgi|910945112e-7741.49%PREDICTED: similar to Heterogeneous nuclear ribonucleoprotein H (hnRNP H) [Tribolium castaneum]
Group
Gene OntologyGO:00001664.2e-09nucleotide binding
GO:00036761.6e-08nucleic acid binding
KEGG pathway 
InterPro domain[341-417] IPR0126774.2e-09Nucleotide-binding, alpha-beta plait
[120-184] IPR0005041.6e-08RNA recognition motif domain
Orthology groupMCL10373 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204455-TA
ATGTTGGGATCCGGTGATGGTAGCTATATTATAAAACTACGGGGCTTGCCTTTCTCCACTACTGCTGAAGATGTCCTCACATTTCTAAGCGGCGTAAATGTTATAAACGATAAAGAAGGTGTCCATTTGACCGAGGTTAGACCTGGTAGGCCTTCTGGAGAATGTTTCGTAGAGGTGCAGAGTCAAGACGATGTCACTCACGCGTTGAAAAAAGACAAAGAAAATATTGGCAGACGGTACATTGAAGTCTTTTCCACCGATCGTCAGGACATGGAGTGGGCGTTAAATGCTATGAGACAGAGTGAAAATGGTTTTGATGTTATACCCAATGTGTCTGATGATTTTGGAATAGTAAAATTGAGGGGCTTGCCTTTTGGTTGCTCTAAAGAGGAAATTATACAATTCTTCAACGGGTTGTCGGTGGTCCAGGACGGGGTGCACTTGCTCTCGGATCACACGGGGCGGGCCTCCGGCGAGGCGTTTGTATACTTCGTAGACAAGCAGAGCGCTCGAGACGCTCTTGACAGGGACATGGAGAAAATAGGACACAGAATGCTACATGCTAACGCTGTGATGACATCAAAGGACTCGTTGTGTTTAGTTGATGTTATCACAGTCTGTTATGAATATTGGCATTCTGGAGTGCATTGGTTGTTGAGTTGTCAAGGTCGAGGGTCGAGGGGCGCGGGCGGAGGGTCGCGGAAGACGGCCGCCGGTTTATATGAGATGCACCAGTTGTATGAGGCTGGAGATAGGGGATTCATGCGATATGATATGGATGTTTTAAAACATATTTTGACCGTACTTGTGCCATACATAGAGGTGTTCCTAAGTTCAGCGGACAAGGTGCGAGCGTACGGCGCTCGGCTGGAGGGTGGCGGGTTCAAGTCCCGGGGATACAGACCCACGCCCTACGACAGGAACGACAGGCTCTCCGGCCGGTTCGGGGGCAGGGGGAGGGGGTCCTTCGCCAGAGGAGGTTCGGGAGGCTCGTACACGGGCCGCGGCGGCCGGCTGGGCTCCAGCCACTGTGTACACATGAGGGGGCTGCCGTTCAAGGCGACGCCGCAGGACATCGCATATTTCTTCAAGCCGATCCGTCCTCTGAATATCAACATACACTACGACAACAGCGGCCGACCCTCGGGGGAGGCGGACGTGGAGTTCGAGTGTCACGAGGACGCCATGAGGGCGATGCGCCGTGACAAGAACAACATGGAACACCGCTACATCGAGTTGTTCATGAACTCCTCGCCGACCTTCAAGAGCCCGCGCCCGTTCCGCACCTACTGA

Protein sequence:

>DPOGS204455-PA
MLGSGDGSYIIKLRGLPFSTTAEDVLTFLSGVNVINDKEGVHLTEVRPGRPSGECFVEVQSQDDVTHALKKDKENIGRRYIEVFSTDRQDMEWALNAMRQSENGFDVIPNVSDDFGIVKLRGLPFGCSKEEIIQFFNGLSVVQDGVHLLSDHTGRASGEAFVYFVDKQSARDALDRDMEKIGHRMLHANAVMTSKDSLCLVDVITVCYEYWHSGVHWLLSCQGRGSRGAGGGSRKTAAGLYEMHQLYEAGDRGFMRYDMDVLKHILTVLVPYIEVFLSSADKVRAYGARLEGGGFKSRGYRPTPYDRNDRLSGRFGGRGRGSFARGGSGGSYTGRGGRLGSSHCVHMRGLPFKATPQDIAYFFKPIRPLNINIHYDNSGRPSGEADVEFECHEDAMRAMRRDKNNMEHRYIELFMNSSPTFKSPRPFRTY-