Monarch geneset OGS2.0

DPOGS200520
TranscriptDPOGS200520-TA891 bp
ProteinDPOGS200520-PA296 aa
Genomic positionDPSCF300676 + 2613-5467
RNAseq coverage1020x (Rank: top 12%)
Annotation
HeliconiusHMEL0119231e-5860.00% 
BombyxBGIBMGA005410-TA1e-8867.19% 
Drosophilanos-PA7e-1551.67% 
EBI UniRef50UniRef50_A7L3A98e-9360.95%Nanos-like protein n=3 Tax=Obtectomera RepID=A7L3A9_BOMMO
NCBI RefSeqNP_001098702.11e-9360.95%nanos-like protein [Bombyx mori]
NCBI nr blastpgi|1574123243e-9260.95%nanos-like protein [Bombyx mori]
NCBI nr blastxgi|1574123244e-9660.63%nanos-like protein [Bombyx mori]
Group
Gene OntologyGO:00037231.5e-29RNA binding
GO:00082701.5e-29zinc ion binding
KEGG pathway 
InterPro domain[186-244] IPR0087051.5e-29Nanos/Xcat2
[192-246] IPR0241618.8e-27Zinc finger, nanos-type
Orthology groupMCL21007 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200520-TA
ATGGACGAAGAGAAACCTTTATTCACAGCGTCGAGCGAGAATCAATGTATGACAAATAAAGAAAGTTTTACACAATCGGACAGAGCTATCAACGCTCCAGACAGTGCATTTTCACCGCTAGCTGCACCATTTCAGAGCAGAATGAACAACGGTCTCCTCAACATGATGCAACAAGAACCACGAGCCGCTGATACTGCGCCCTCCGCCGCCGATTTAGATGACGTCTTGGTGAATCTCCGCATTAATGGCAACGAGCCAATATTTGACGAAATGGATCAGAATTCGCAAAATTACTTTAATATGGGGGGGAGCAGAACTGATACGATGGCGGGATCTCATTCTAATATATGGGCTGAAGGATCTACACCTTCGGGGTCTAGGAATGGAGAGATGATTGACACCTTCGATTTCCTTATTAAGAGTCTCTCTGCGGCCAACAATTACGTGTCCATGTTGACTAGAGAGCAGTTATCTGTGCTCCGGTGTATAAGACCAAGTCTCTTGTACGAAATGTTGCAGGAGGTTGCTAAGGTCCGTAGTGACAAGCGAATGAGACGAGCTCTTCCTAACGAGTGTGCTTTTTGCAAAAACAACGGTGAGAACGAGGAATCGTACACATCGCACGCGTTGAAGGACTGGCGCGGACGTGTCGAGTGTCCCGTGCTGAGAGCGTTCCGCTGTCCACGGTGCGGCGCGACCGGGGACAGAGCTCACACGATCAAGTACTGCCCCGAGAACGGAGACGCCGGTATGGACCGTGGTGGTCTACTATCTCGTCGTCGCGCGCCTTCCGGTTTGCTTCTTGGTAGAGCTGGTTGCAGCACCCCGACACCCACACAGTCACCATCCGTCAATCAGCCGTCATTGTGGTCTAATTTCGGCATAAACTAA

Protein sequence:

>DPOGS200520-PA
MDEEKPLFTASSENQCMTNKESFTQSDRAINAPDSAFSPLAAPFQSRMNNGLLNMMQQEPRAADTAPSAADLDDVLVNLRINGNEPIFDEMDQNSQNYFNMGGSRTDTMAGSHSNIWAEGSTPSGSRNGEMIDTFDFLIKSLSAANNYVSMLTREQLSVLRCIRPSLLYEMLQEVAKVRSDKRMRRALPNECAFCKNNGENEESYTSHALKDWRGRVECPVLRAFRCPRCGATGDRAHTIKYCPENGDAGMDRGGLLSRRRAPSGLLLGRAGCSTPTPTQSPSVNQPSLWSNFGIN-