Monarch geneset OGS2.0

DPOGS212376
TranscriptDPOGS212376-TA1902 bp
ProteinDPOGS212376-PA633 aa
Genomic positionDPSCF300019 + 330033-331934
RNAseq coverage278x (Rank: top 39%)
Annotation
HeliconiusHMEL0056653e-13059.52% 
BombyxBGIBMGA004642-TA5e-8577.68% 
Drosophila% 
EBI UniRef50UniRef50_E2C5Z64e-2757.89%Uncharacterized protein C19orf43 n=17 Tax=Coelomata RepID=E2C5Z6_HARSA
NCBI RefSeqXP_001603041.14e-2863.46%PREDICTED: similar to ENSANGP00000013532 [Nasonia vitripennis]
NCBI nr blastpgi|3071942241e-2657.89%Uncharacterized protein C19orf43 [Harpegnathos saltator]
NCBI nr blastxgi|3071718522e-3940.36%Uncharacterized protein C19orf43 [Camponotus floridanus]
Group
KEGG pathway 
Orthology groupMCL34937 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212376-TA
ATGTCCTCGTACCGCGGTGATGCGGATTCCTGGGACGGCGGGCCGCCAGGCGCCGAGGCGGACTACGCACCGCCGCCTGGCCCACCGCTCGCTGCTCCCGTGCCCACCACCGACCCCTGGACGGGGGCCAGCTACTCTCATTACGGCGCTGCTCAATACGATTACCACTCTTACAGTGCTCCCTCTTACGGTTATGGTTACGATTCCGCTTATTACCAGCCCGATGGGGGTTATTACAACTCTAGTAGTGCTTATCCATCTCAAGGCCAGAGCGGTTATGAATACAGCAAGACGAGGTCTCGGCCTTCTCCTGAAAGTCGTGGCTACGATCGCCGACGTCCGTCCTACAAATCAAAGTCCCGAAGCGTATCGCCCTACGATAGACGGGAAAGAGAGTATTCGAAGAAAAGAGACAGTTATGAGAATATGAGACGGCGAAGATCACGTAGTACCTCCAAACCTAAACGTTCAAGGTATTCGTCGAGTGACCGGTCAAGCTACCGGTCTAGATCATACTCTAGGTCACCTGGAAAAGATATGAAATCAAGAAAGTCGAGATCATCAGAATCGGCCAGATCCGAAAGATCTTTTAAAAAAGCTGGAACGAGTAGAAAGGATCGTTCATTGACTCCTCCAATGCGCAATAAAACTTCACAGTCCCGGAACAGGTCTAAGTCTTATGAAAACGGTAAATGTCGGTCTCCAAAATATAAGTATGACAAGGAATCTAAAGGTAAAAAGAACTCTGTGCCCAGAAAAGACACCAAGACTAAAGACAGAAGAGAGCGTGAAATTTTCACGCCGCCTAGAAATAAATCCACGGACTCTTTAAAAATAAGAAAAAAATCTGATAGATATTCCAAAAGCCCACCTGTAAATAAAAAGTCTGAACGATCTCCACAAAAAAAATCAAGTTCCAAAACAAAAGTTCAACAGAAAAAATCGAGAGACAGCTCCGTGACTCCCCCTCGGAGTTATGCAAGAGAAGCTACTCCTAGTTCGAGATCGCATTCCGAATCTCTAACACCAAGTAGCAAAAGAAAGCGATCTCTCACACCGAAATCATTTCCCAGCAGATCTTCCTCAAGCCATTCGTCTGATTCTTCTGTTAGGTCACGTTCAAAACGTCGTACGAGACGTAAATCCAAACCCGGCTCAAGAAGTCGCTGCGGGTCACGGGAACGATCTCTAAGTAAAAAAAGATCCCACTCTCGCGGTCGGTCTAGTTCTCGGTATAGGTCGAGAAGATCAAGAAGTTCTAAGACGTCTAAGTCTAGATCCAGATCTCGGAGCTCCCGGTCGCGCAGTCGCAATCGTAGTCGTAGTCGCAGTGGCACTGCAAGTGAAGATGAACGCAGGGGTCAATTTACAGTGGCAGACAGAAAAAGGTTTTGGAAAATGCACAGAAGTAGACAAATGGAGCGGCTGAAATCTCCACCTAAGGATGTAAGACCGCCGCCCGGTGCGGTCGAATCTACGGCACTGGACGATATCGAGTATGGAGTCCCACCGGAAGTGGAAGGCCCGAATTTCGCTGAACTTTTACCAACACCAGAACAAGTAATGCAATTACCGTCAGCCTCGAAGTCCAAACCAGTACCGATACCAATAAAGAATGATGGAAGTTTTCTGGAAATGTTTAAGAAGATGCAAGAAGAAACTAAGAAGATTGAAGCTACTGAAACAAAACCTGCCATCAAGAAACCTGTTTTACCATTTATAGGGAAAAGAAGAGGGGGGAGAGTACTTAAAACAGGATTGGTGAAAAAAGCGAAAGCTATAGATGAACAAACAGTAGACAACACGCCCAAAGACGCTTGGTCTCTTTATATGCAAGAAGTTAAGAAATATAGGGAAACTTCGTGTCAGGAGGAGAGAAAGACTAGGCCTCTCGTCAAATGA

Protein sequence:

>DPOGS212376-PA
MSSYRGDADSWDGGPPGAEADYAPPPGPPLAAPVPTTDPWTGASYSHYGAAQYDYHSYSAPSYGYGYDSAYYQPDGGYYNSSSAYPSQGQSGYEYSKTRSRPSPESRGYDRRRPSYKSKSRSVSPYDRREREYSKKRDSYENMRRRRSRSTSKPKRSRYSSSDRSSYRSRSYSRSPGKDMKSRKSRSSESARSERSFKKAGTSRKDRSLTPPMRNKTSQSRNRSKSYENGKCRSPKYKYDKESKGKKNSVPRKDTKTKDRREREIFTPPRNKSTDSLKIRKKSDRYSKSPPVNKKSERSPQKKSSSKTKVQQKKSRDSSVTPPRSYAREATPSSRSHSESLTPSSKRKRSLTPKSFPSRSSSSHSSDSSVRSRSKRRTRRKSKPGSRSRCGSRERSLSKKRSHSRGRSSSRYRSRRSRSSKTSKSRSRSRSSRSRSRNRSRSRSGTASEDERRGQFTVADRKRFWKMHRSRQMERLKSPPKDVRPPPGAVESTALDDIEYGVPPEVEGPNFAELLPTPEQVMQLPSASKSKPVPIPIKNDGSFLEMFKKMQEETKKIEATETKPAIKKPVLPFIGKRRGGRVLKTGLVKKAKAIDEQTVDNTPKDAWSLYMQEVKKYRETSCQEERKTRPLVK-