Monarch geneset OGS2.0

DPOGS203564
TranscriptDPOGS203564-TA2136 bp
ProteinDPOGS203564-PA711 aa
Genomic positionDPSCF300055 + 599976-602503
RNAseq coverage4x (Rank: top 89%)
Annotation
Heliconius% 
BombyxBGIBMGA008558-TA7e-4944.44% 
DrosophilaRcd1-PB2e-1827.56% 
EBI UniRef50UniRef50_Q9P2N67e-2127.95%Uncharacterized protein KIAA1310 n=81 Tax=Eumetazoa RepID=K1310_HUMAN
NCBI RefSeqXP_395713.22e-2026.82%PREDICTED: similar to CG8233-PC, isoform C, partial [Apis mellifera]
NCBI nr blastpgi|3485304401e-2127.17%PREDICTED: uncharacterized protein KIAA1310 homolog [Oreochromis niloticus]
NCBI nr blastxgi|714179433e-2632.63%hypothetical protein [Trypanosoma cruzi strain CL Brener]
Group
KEGG pathway 
Orthology groupMCL21804 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203564-TA
ATGGCGATCGATCCGAGCTGTCTTCACGAACGGCTCGTGGAACACGTGAGAGCGGTCAGCGGCGAGCCTAACAGTCAAACCGGCGTTTGGGGCGAACACTCTTACGCGCGGCCCAGAGGCGTCGCCCCCGACCCGCTGATACGAACGCTGCTCGCGCCGCGGCCTTCCTCGCCCGACGAAGGAGACGTCTTGGATGTCGAGGGAAGTCCGCCCTCACCCCCCGGACTGCCCTTGGACGTGGATGACTCACGAGACTCTGAACAGGACGACCACGACGGGGACGACGAGGACTGGGAGAAACGAGTGGCGGCGCTCGCGCCCACGGCGGGGCACGCGCGGCTCGCGCGAGATGCGGCCGTCGCGCTCCGGGGGCTGCGACTGGAGCGGCTTGCGGGCCGGGGAGGACGGTGGGACAGGACGGATGGGGCGCGGGCCGCAACCAGGAGACTGAGACGAGCGCTCGCCTCGCACTGGACGAGCGGCGCCGCGGCCTGGCTCCATTCCACGATGACCACGAGTCTGCCTCGGTCGCTGAGGGCCCACTACGACGAGGTGCTGGCGGAGCTGTGGCGGACGGTGCCTCGCCTCGCAGAGCGCCTGGCCGCACCCCGACCCCTCGTAATCCAAAACGACCCTCTCGCCGTCGTCGGAGAGAGGCGGCCGGCGAGCGAACCCGGCCCGTGGCTAGCGTGGGCGCCGAGCGGCAGCGAGACGGAGGATGCGCGCTGGGTGAGGCGCCTCGGGGCCTTGATCCACGTGCGGGAGCTGGTCCCAGCCGCGCCGCACACGCCGGGCCTCGCCCCGGACCGGTGGTGCGCGACCCTGGCGCACGAGGCGCGAACCGCCCTCACCGAGCTGCTCTCGGAGGCCGGACGGAGGCCCGTGTTGCTGGGCGGGGCGGGCGCTGGCGCGGCTCTGGGGTCGTGGCTGGCGACGGGCGGGGCCGGCGCTCGTGTTCGCGGACTGGTCCTGCTGGCTCCGCCCTTGCTGACGGCCGAGGGTCCCCGAGACGCGGCGGAGGAGCCCGCGGACGAGCCGGACCTGCCCCTGCTGTGCGTCTCGGGGTCGGCGGGGGCGTCGTGCTGGCGGAGCGCGGCGGCCGAGCTGTGTCGCGGCGCTCCGCGGGGCGCGTGTCGCCGGGTGCTGGTGGTGTCGGGGGCGGACGACCGTCTCCGGCTGCCTCGCGCGTGTCGCCGTCGTCTCGGAGTGCCGCAGGAGGCGCTGGACGCGGCCGTGTCGGAGGAGTGCGCGCGGTGGTCGCTGGACGTGGCGGAATCGTCGCCGAAGGAGCGCTCCAGACGGAAGGAACTAAATATGAATAAATACAAAGCTCACATGTTCGTATTTTTCCAGTCGGCGAGGAGCACCGCGGAACACAGTGAGTATGTCTGTGTGTCACACACCGAGTCCTGGCCGGGCGTCGAGCTCGCTCGCCTCGGGGCCGGCGGTCTCACATACTCCTACAGACGTTCTATTCCCGACGTCTCTCTTAACGCCAGGGCGTATAAAACAATCGTTTCAATGATCGAACTCCGCCTGAGTAAAGCTATTAGCAAATGGACCATGTTACTTCCGCCAAGTGTCTCTACCACGAGTCAGTCTGATGTTTGGTCCCTGCATCGATTTTTTGGAATAAAATACACGTCGACCATCAACACTGGTTCTGTATCCAAAGCGCACAGTTCCTTCCGCGACTGTAGGCCACCGGCTCCCCCCCAGCTGTATTCCCTGCCCGCCACGTATCGTCTCACTCCTGGAACCGCTCATCTAATCCGAATCGTTTTCATGTCACAGGTCAAGCCGCTCCATCGAGATAAGGGTCGCGTGGTGCAGCGCGGGACCGGAGGCGTGGCTCTGGAGCTGCAGCCGCCGCGCCGCGCCAAGAGGTCGGAGAGGTCCGAGCAGGCGGGTGCGGAGGCGGGCGGCGAGGAGTCGCTGGCGGCCGCCGACATCATGCAGCTACCCATCGTGTTCGCTGACGACGAGCCCCCCGCGCCTCCCGCCGCCTCCGCCGCCCCCCTCGCCCTGACCGTGACGAGCGGAGCCCCGCGCGCCGTGCGGTACACGCGAGTCATCGTGGCCAAGAAGACGGCCCGCCGCCGCCACAGGCCGCCGCGCCCGGCCCTCGACCACTGA

Protein sequence:

>DPOGS203564-PA
MAIDPSCLHERLVEHVRAVSGEPNSQTGVWGEHSYARPRGVAPDPLIRTLLAPRPSSPDEGDVLDVEGSPPSPPGLPLDVDDSRDSEQDDHDGDDEDWEKRVAALAPTAGHARLARDAAVALRGLRLERLAGRGGRWDRTDGARAATRRLRRALASHWTSGAAAWLHSTMTTSLPRSLRAHYDEVLAELWRTVPRLAERLAAPRPLVIQNDPLAVVGERRPASEPGPWLAWAPSGSETEDARWVRRLGALIHVRELVPAAPHTPGLAPDRWCATLAHEARTALTELLSEAGRRPVLLGGAGAGAALGSWLATGGAGARVRGLVLLAPPLLTAEGPRDAAEEPADEPDLPLLCVSGSAGASCWRSAAAELCRGAPRGACRRVLVVSGADDRLRLPRACRRRLGVPQEALDAAVSEECARWSLDVAESSPKERSRRKELNMNKYKAHMFVFFQSARSTAEHSEYVCVSHTESWPGVELARLGAGGLTYSYRRSIPDVSLNARAYKTIVSMIELRLSKAISKWTMLLPPSVSTTSQSDVWSLHRFFGIKYTSTINTGSVSKAHSSFRDCRPPAPPQLYSLPATYRLTPGTAHLIRIVFMSQVKPLHRDKGRVVQRGTGGVALELQPPRRAKRSERSEQAGAEAGGEESLAAADIMQLPIVFADDEPPAPPAASAAPLALTVTSGAPRAVRYTRVIVAKKTARRRHRPPRPALDH-