Monarch geneset OGS2.0

DPOGS214010
TranscriptDPOGS214010-TA1317 bp
ProteinDPOGS214010-PA438 aa
Genomic positionDPSCF300313 + 13897-16091
RNAseq coverage75x (Rank: top 65%)
Annotation
HeliconiusHMEL0107183e-7947.40% 
BombyxBGIBMGA011735-TA1e-4937.94% 
DrosophilaCG10948-PC6e-3533.33% 
EBI UniRef50UniRef50_UPI00020605ED8e-4131.48%UPI00020605ED related cluster n=1 Tax=unknown RepID=UPI00020605ED
NCBI RefSeqXP_001948785.11e-4131.48%PREDICTED: similar to GA10660-PA [Acyrthosiphon pisum]
NCBI nr blastpgi|3286988443e-4031.48%PREDICTED: ecto-NOX disulfide-thiol exchanger 1-like [Acyrthosiphon pisum]
NCBI nr blastxgi|3071681289e-5332.18%Ecto-NOX disulfide-thiol exchanger 2 [Camponotus floridanus]
Group
Gene OntologyGO:00001663.9e-11nucleotide binding
GO:00036761.4e-08nucleic acid binding
KEGG pathway 
InterPro domain[211-332] IPR0126773.9e-11Nucleotide-binding, alpha-beta plait
[252-297] IPR0005041.4e-08RNA recognition motif domain
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214010-TA
ATGAACACATGGGCACAAGAAAATTACCAGATGGGGATGTCTCCAATAATGCCACCATCTATATTACCACCACCTATGATGATGCAGGAAATGAACGTCCCTCCTCTGAACCCACCGCCTATTCCTCTACTTACATCTATTACGACCGAAGACATGAATATCGTGACTGACGATATGAGTGATACAAATATAACATGCCATGATGTCTCTGGTAATGAAACACGAAACCATCGTTCTAGGAGCCGTGATCGTTCCTTACGACGTGACAAAGATCATCAAGATCGAAGACATCGATCCAGATCGAGGGATTGCCATGATAGATATAATAGGAACGACAAGAGGAATGTAGAAAGAAATGATAGAACCAGGAATGTTGAAAGGGAACGGAAGACAAAATGGGATAATAATAGGGTACCAAAGAATAATGTGCAGCAGATTCATGCTGGAATTAACATGGGTATGAATATGGGCATGATTCCAGGTATGAACATGATATCTTTCCAAAATATGCTCCCGAACATGATAGGTCAGCAACCACTGGATGGCACGCTGATGCCACAGCATATAATGCCAAATATCATGATGGCCAACATGATGCCAAACATGTTAGACCAAAATATAATGATGATGAACCAACAAAACATGATGCCGCTGCCCAACCAACAGATATATCTTAATAATGGAGTAATGCTACCTCCGATCCCTGGAACTGTTACCCCGGAACGACGGGAAAGGCCCAAAGGTTGTCGCACAATATTTGTTGGGGGCTTACCATTAAATGTAACAAATGATACATTGATGGAAATGTTCCAAAAATTTGGTAGTATAGAGGATATTAAATCGCCTAAAAGCGGTGTTTATTATATCCGTTTTGAAAGACCTGAGAGTGTGGCGCCATCTTTCTTCTTAACGGGATATAGATTTAAATTTCATGATCAAATTGAAAATGAAGCTACAACGATCTTTGTTGATTATGCCTTGAATCGCGATGATCAGAACGAATATGAGCGTAGACAACGCCACAGAGAGAAGACCCCGCCGCGTGTGGAGCCCTTCTCACAGACAGCGCTCACAAATCTATCTGAAAAGATCAAAAGTGAAACTGAATTTGCCACCGCTGCTCCTACTCTAGCTGTGTGGCTCGAGCGCGGGGAGTGCAGTAAGAGGCACGCGAATGCCTTCTACTCACTAATACAAGCGAGCAACAATCAAATAAGGAGACTTTTCACAGAGAAGATGCAACTAGACGATGACTTCCAAAATATGAAGAACGCTATCAAGGAGAAATTCGCTCACGTTGTCATGCAGTGTGAGTAA

Protein sequence:

>DPOGS214010-PA
MNTWAQENYQMGMSPIMPPSILPPPMMMQEMNVPPLNPPPIPLLTSITTEDMNIVTDDMSDTNITCHDVSGNETRNHRSRSRDRSLRRDKDHQDRRHRSRSRDCHDRYNRNDKRNVERNDRTRNVERERKTKWDNNRVPKNNVQQIHAGINMGMNMGMIPGMNMISFQNMLPNMIGQQPLDGTLMPQHIMPNIMMANMMPNMLDQNIMMMNQQNMMPLPNQQIYLNNGVMLPPIPGTVTPERRERPKGCRTIFVGGLPLNVTNDTLMEMFQKFGSIEDIKSPKSGVYYIRFERPESVAPSFFLTGYRFKFHDQIENEATTIFVDYALNRDDQNEYERRQRHREKTPPRVEPFSQTALTNLSEKIKSETEFATAAPTLAVWLERGECSKRHANAFYSLIQASNNQIRRLFTEKMQLDDDFQNMKNAIKEKFAHVVMQCE-