Monarch geneset OGS2.0

DPOGS200518
TranscriptDPOGS200518-TA1119 bp
ProteinDPOGS200518-PA372 aa
Genomic positionDPSCF300450 + 138829-144112
RNAseq coverage214x (Rank: top 45%)
Annotation
HeliconiusHMEL0178602e-9854.87% 
BombyxBGIBMGA001807-TA7e-11149.87% 
Drosophilatld-PA2e-0925.22% 
EBI UniRef50UniRef50_UPI00019272A21e-1130.09%UPI00019272A2 related cluster n=1 Tax=unknown RepID=UPI00019272A2
NCBI RefSeqXP_003101655.15e-1327.27%hypothetical protein CRE_11189 [Caenorhabditis remanei]
NCBI nr blastpgi|3084788909e-1227.27%hypothetical protein CRE_11189 [Caenorhabditis remanei]
NCBI nr blastxgi|3084788906e-1227.27%hypothetical protein CRE_11189 [Caenorhabditis remanei]
Group
Gene OntologyGO:00065086.3e-23proteolysis
GO:00042226.3e-23metalloendopeptidase activity
KEGG pathway 
InterPro domain[159-364] IPR0240794.6e-25Metallopeptidase, catalytic domain
[163-361] IPR0015066.3e-23Peptidase M12A, astacin
Orthology groupMCL26469 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200518-TA
GACTTCCACTCTCAAGAAATCATAATGACTACCCTGTCTCGTATCATGGACGAGCTCTGCGTAAAATTTTTCCAGAGTCCGATGAAGTACAACTCCTCAGATACGGATAAGATTTTGTACATCAGCAATCCGGGGAAGAGGAAGACGTGCGAAACTAATCGTTACAATTTTAACAAACCGGTCGTTGAAATGGCAATAGGTTACAAATGTTTAAACCCTCACGACATAACAGCCGCTGTGGTCGAAATGTTACGAGCAAGTATAGACCCTAAAGGTTTGACGGTAAACAGCTTCGACCTCATTAAGAAATTTAATGAAAAGGAATCTTCAAGTTCAATGTTGCTATCGGCCAGCGACAGGAATTACATCAATGCCCACTTCAAAACGGAGTGCGGTTCGATGTCCAAGCCTCTGTTGATCCAAAGACGTCTTGGCTCTTTGGATGTGGAAATGAGCAAAGCTAACGTAAAATACTACAAAGACAAAGTGTGGCCCCTGGGTATAGTTCTTTACGGGGTTGATGACGAACTAGGTAGTTCGCTGGATTACAAGATTCTGAGACACGCTATGACTTCCATTGAGATATCAACTTGTGTGGTGTTCCAAGAAGTCAAGACTGATGGTGTCTTGGAAGCTAAGAGTCAAATCTGGTTTTCGAGAGATGGTAACGAAATGCCCATGTTCGGTTTTGTTGAAAACAAACAGACAGTAAAACTGTCGTCCTTCGTTAATGGCGCCGCGGGCCACGACGCCCACGTTTACAACAACTTGTTCAGGGTTCTTGGAGTGCACATGATGTCGAATAGATTTGACAGAGATAATTACGTCACAATAACTTGGCGGAATGTTGAGAAGGGTAAAGAGCAGTATTTGGAACGTTCACCAGAAGAGGCTTGGCTAACGCAGATACCTTACGACTTCGACAGTATCACCCACGCTCCAGCCAATTATATGTGTGGAGACTGCTCGCTCGGAGCGGCCACCGTGCAACCCATACAGGATTACCTCTGGCAGAGAACTATATCGATGGGTCATTCGAAAGAATTAAGCAAATACGACGTACAAATTATCAATATGTTGTATATAACACAGTGTCGTAAACGATTATTTGACGTGTAA

Protein sequence:

>DPOGS200518-PA
DFHSQEIIMTTLSRIMDELCVKFFQSPMKYNSSDTDKILYISNPGKRKTCETNRYNFNKPVVEMAIGYKCLNPHDITAAVVEMLRASIDPKGLTVNSFDLIKKFNEKESSSSMLLSASDRNYINAHFKTECGSMSKPLLIQRRLGSLDVEMSKANVKYYKDKVWPLGIVLYGVDDELGSSLDYKILRHAMTSIEISTCVVFQEVKTDGVLEAKSQIWFSRDGNEMPMFGFVENKQTVKLSSFVNGAAGHDAHVYNNLFRVLGVHMMSNRFDRDNYVTITWRNVEKGKEQYLERSPEEAWLTQIPYDFDSITHAPANYMCGDCSLGAATVQPIQDYLWQRTISMGHSKELSKYDVQIINMLYITQCRKRLFDV-