Monarch geneset OGS2.0

DPOGS206840
TranscriptDPOGS206840-TA1572 bp
ProteinDPOGS206840-PA523 aa
Genomic positionDPSCF300001 - 3187124-3190019
RNAseq coverage274x (Rank: top 39%)
Annotation
HeliconiusHMEL0132720.077.10% 
BombyxBGIBMGA013072-TA0.070.29% 
DrosophilaCG5706-PA6e-2429.44% 
EBI UniRef50UniRef50_F4WQ349e-13248.36%Leucine-rich repeat-containing protein 47 n=8 Tax=Endopterygota RepID=F4WQ34_ACREC
NCBI RefSeqXP_394881.31e-13147.25%PREDICTED: similar to leucine rich repeat containing 47 [Apis mellifera]
NCBI nr blastpgi|3800132577e-14049.32%PREDICTED: LOW QUALITY PROTEIN: leucine-rich repeat-containing protein 47-like [Apis florea]
NCBI nr blastxgi|3287830556e-14049.51%PREDICTED: leucine-rich repeat-containing protein 47-like [Apis mellifera]
Group
KEGG pathwaybfo:BRAFLDRAFT_1245654e-25 
 K01890 (FARSB, pheT)maps-> Aminoacyl-tRNA biosynthesis
Orthology groupMCL14638 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206840-TA
ATGAGTGTCTGGCCAGAAGTAGCGACGGCAAAATCCGAAAACAGGCATGAAATCAAATTGGCGGGTGCTGCTATATCTAAACGCATTTCGGAGGAAGGTTTAGACAAGACAGTCTTTAAATTAACAAATATTAATCTACTAAACATTAGCGATACATGTCTTCCATCTATTCCCGATGAAATCAAATCACTTGTTAACCTCCAATCGCTATTGTTATACGGGAATAAGTTAACAGAATTTAATGAAAACATAACCTCTTTGAAAAAACTTAAGGTGCTCGATTTATCTAGAAATCAAATAACAAGCATTCCGGATAGTTTAAACAACATGAAAGAGCTGTCCAGCATAAATTTTAGTTCAAATAAAATCGATCACATGCCAGCCTTTGGTGATTTTCCTAACTTAATCTCGATAGATCTATCGAACAATAAATTAACGGACTTTTTGAATACCGAACAGGCAAACCTGCCTCATTTAACAGACTTAAAGATTAAGGGAAACGAAATAGAAACACTTCCAGGTTATATAGCAAGTACCATGCCTTCACTAAAAAATTTCGATATTGGGGAAAATAAATTGAAAACACTCCCTGGAGAAATAGCTGGAATGGCAAAGTTGAAAGAATTGAATCTAAAAGGTAACAAGCTTTCTGATAAGCGTCTCATGAAGCTAGTAGACCAGTGTCGCACAAAACAAATTGTCGACTACATTAGAGAGCATTGTCCTACATCAGACAGTAACCAAGCTACAAGCAAGGGTAAAGGGAAGAAAGCCAAGAAACAAGATGAAATCCTACCAGACAATGCATCTGAACTCAGTCACACTCTGAAAATTATGCACCTAGACGATGATACATTGAAAATAAAAATTATAGAACAAGAAGTCTGGAACATTCGCCCCTACATACTCAGCTGTATAGTGTATGGGTTGAACTTTGATGAGGCACTCTTCAAGAAATTCCTGCAAATGCAAAATAAATTACATGACACTGTGTGTGACAAACGGAATGTAGCCACGCTGGCTACGCATGACATGAGTAAAATACCTCCAGGTGACCTTGTATACACAGCCAAAACACCATCGGAATTAAAACTGATTCCTTTAAATCGAACAAAACATTTTACGGGTGAACAATTGTTCCAGCAATTGACAAACGAAGCGGATGCACTGAGGAAGGAAAAGAAAAGAAATGTTTATTCAGGAATACATAAATACCTGTACTTACTTGAAGGCAAACCCAAATATCCATGCTTAGAAGATGCAACAAAGAGAGTGATCAGCTTTGCCCCAATCACCAACTCTGAGGAAACTAAAATGACGGTAGACAGCAAATCAATGTTAGTGGAAGTGACATCGCATTCATCGCTCGGTGCTTGCAAAACTGTCATGGATAAACTCCTACAAGAATGTTTGATGCTTGGCATTGGGGAAGGAGACGGTGACTTCCACACATTGACTGTGCAACAGGTCAAAATTGTGGATCCCGAGGGTAATCTGAAGAGCATCTACCCATCAAGAACAGACTGTGTTTACGATAGCACCATCAAAGTTTACAGGATTCCCAAGAAATAA

Protein sequence:

>DPOGS206840-PA
MSVWPEVATAKSENRHEIKLAGAAISKRISEEGLDKTVFKLTNINLLNISDTCLPSIPDEIKSLVNLQSLLLYGNKLTEFNENITSLKKLKVLDLSRNQITSIPDSLNNMKELSSINFSSNKIDHMPAFGDFPNLISIDLSNNKLTDFLNTEQANLPHLTDLKIKGNEIETLPGYIASTMPSLKNFDIGENKLKTLPGEIAGMAKLKELNLKGNKLSDKRLMKLVDQCRTKQIVDYIREHCPTSDSNQATSKGKGKKAKKQDEILPDNASELSHTLKIMHLDDDTLKIKIIEQEVWNIRPYILSCIVYGLNFDEALFKKFLQMQNKLHDTVCDKRNVATLATHDMSKIPPGDLVYTAKTPSELKLIPLNRTKHFTGEQLFQQLTNEADALRKEKKRNVYSGIHKYLYLLEGKPKYPCLEDATKRVISFAPITNSEETKMTVDSKSMLVEVTSHSSLGACKTVMDKLLQECLMLGIGEGDGDFHTLTVQQVKIVDPEGNLKSIYPSRTDCVYDSTIKVYRIPKK-