Monarch geneset OGS2.0

DPOGS211425
TranscriptDPOGS211425-TA1071 bp
ProteinDPOGS211425-PA356 aa
Genomic positionDPSCF300115 + 512906-515040
RNAseq coverage44x (Rank: top 72%)
Annotation
HeliconiusHMEL0129531e-1424.32% 
BombyxBGIBMGA001263-TA1e-4740.15% 
DrosophilaCG7492-PA1e-1723.23% 
EBI UniRef50UniRef50_UPI00020613772e-9147.04%UPI0002061377 related cluster n=1 Tax=unknown RepID=UPI0002061377
NCBI RefSeqXP_001945109.13e-9147.19%PREDICTED: similar to conserved hypothetical protein [Acyrthosiphon pisum]
NCBI nr blastpgi|3287102259e-9147.04%PREDICTED: hypothetical protein LOC100573332 [Acyrthosiphon pisum]
NCBI nr blastxgi|3287102256e-9147.44%PREDICTED: hypothetical protein LOC100573332 [Acyrthosiphon pisum]
Group
Gene OntologyGO:00167888.6e-09hydrolase activity, acting on ester bonds
KEGG pathway 
InterPro domain[106-286] IPR0069128.6e-09Putative harbinger transposase-derived nuclease
Orthology groupMCL25002 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211425-TA
ATGCTCAATGAAGATTCTGACGAGTTTAAGAATTTTTGTAGAATGTCACCAAATGATTTTGATTTTTTGCTAAGCAAGGTGGAACCTTTAATTACCAAGCAAAAAACTAGACTCAGAGTACCCATCCCCGCAAAAGTGCGCTTAGCTCTAACTTTAAGATTCTTAGCTACAGGCGACAGCTACAGGAGTCTCCACCATCTATTCAAAATTTCTAGTGCAGCCATCACATTCATTATACAAGAAGTGTGCACGGCTATCAACACAGTCTTAAAGGATCAAATTAAGATGCCGCGCACTACTACAGAATGGTTGAACATAGAAAGTGGTTTCAGCAGGAAGTACCCACATTGTGTAGGTTGCATCGATGGCAAACATGTTGTGATACAATGTCCGATCAACAGTGGCACAGAAAAATATAAAGGAACATATAGTTTTGTGCTCTTGGCTTTAGTTGACAGCAATTACTGCTTTATATTTGCCGATATCGGGGCTCAGGACAGAATAAGCGACGGAGGAATATTTCAAAATAGTGTACTTTGGGAAAAAATTTCGACAGGGACTATTAATTTACCCCCTGATAGTCCACTTTCTGATGGACAGTGTAATATGCCACATGTTTTTCTTGGAGATGGGGCGTTTGCGTTGAGCAAACACGTGATGATACCGTTTCCAGGAAACCATGTCATGGGTTCGTTACAACGAACTTTTAACATGAGACTATCTAGTGCACGTGTTGTCGTTGAAAATGTTTTTGGACTCTTAACAACAGTCTTTAGGATATTTAAAAAGCCCATGGAAATCAAGAAAGATAAAGCTAAACTGATAACTATGACTTGTATTCTATTGCATAATTTTCTCAGAAACAGCAGAACGTCTAGAGACATTTATACACCTCGTGGGACCTTTGATACAGTTGTTGATGGCGAAATAATGAACGAAGGGTCTTGGAGACGAAATGTTTGTACTAATCAAGCTATAAGACCTATACCAGTTGTAGACAGCCGTGCGTCACAGAGCGCCATACTAATAAGAAATGAGTTTGCTAGTTTTTTTCTAAAGCAAAATTGTTAA

Protein sequence:

>DPOGS211425-PA
MLNEDSDEFKNFCRMSPNDFDFLLSKVEPLITKQKTRLRVPIPAKVRLALTLRFLATGDSYRSLHHLFKISSAAITFIIQEVCTAINTVLKDQIKMPRTTTEWLNIESGFSRKYPHCVGCIDGKHVVIQCPINSGTEKYKGTYSFVLLALVDSNYCFIFADIGAQDRISDGGIFQNSVLWEKISTGTINLPPDSPLSDGQCNMPHVFLGDGAFALSKHVMIPFPGNHVMGSLQRTFNMRLSSARVVVENVFGLLTTVFRIFKKPMEIKKDKAKLITMTCILLHNFLRNSRTSRDIYTPRGTFDTVVDGEIMNEGSWRRNVCTNQAIRPIPVVDSRASQSAILIRNEFASFFLKQNC-