Monarch geneset OGS2.0

DPOGS212002
TranscriptDPOGS212002-TA2322 bp
ProteinDPOGS212002-PA773 aa
Genomic positionDPSCF300136 - 341122-345785
RNAseq coverage194x (Rank: top 48%)
Annotation
HeliconiusHMEL0058562e-9252.39% 
BombyxBGIBMGA004517-TA4e-9651.08% 
DrosophilaCG14814-PB5e-4045.83% 
EBI UniRef50UniRef50_E0VV376e-5153.30%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VV37_PEDHC
NCBI RefSeqXP_001603730.16e-5353.76%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3071696997e-5254.01%Acidic repeat-containing protein [Camponotus floridanus]
NCBI nr blastxgi|2420190541e-4941.12%conserved hypothetical protein [Pediculus humanus corporis]
Group
Gene OntologyGO:00054881e-19binding
GO:00055151.2e-05protein binding
KEGG pathway 
InterPro domain[584-699] IPR0066401.1e-24Domain of unknown function SprT-like
[351-484] IPR0119901e-19Tetratricopeptide-like helical
[437-469] IPR0131052.4e-07Tetratricopeptide TPR2
Orthology groupMCL25028 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212002-TA
ATGGATAACGCGAAATCTCAAAATAATCCGCGGAATCGGTTTTTAAAATTGTCTTTGAAGAAAAATGGGGTAAAACAAATATCAACACAAAAGAATATGCGTTCTAAAGCTGATGTTATAATTCTTGATGATTCGTTTGATAAATTACGGTTGGAATTTCCTAACGCTTCATTCAATGCTACTTTAGAATTTACATCGCCAGCTAACCCCAAAGAGAATAGACCAAGTATAGGAGGTTGGTCACCAGCTCAGAATATTGTGTGTGCAACACCAAAGAATGACCCTCTACCCTCAACACCTCAAGACAAAGCACGTAAATCTTCAGAACAGTGTTCCATAAAAAAGATACATCAGTCACAACAGAAACTTCTTAGCGATTTATATGGGGAAACATGGAAATCAATACCCTCACTCTTCAAAACCATACAAGAAAATCACAAAAATTTCAACGGGGTATCAAAGAAGTTGAATTTCAATGATGATGACAAAGAGAATATTAGAAGCGATTTGGAAAGGAACAAACAGTTGTATTTGACTGAATCGGAAACCAAAAGACGAGTAGACTTGTTGAATAGTGATAAGAAATCAAAGAAGAAGCTTTTCACAGAAAAAGTACCGAGTACTCCAGAACCTCCTAAACAGAAATCCAAAATTAATGTCAATACAACATCCAAGAAGAAAAAGTCAATGACAGTTACCGAGCTTGTTGAAGTGATGACAAACGATGTTAACTCATTGTCAGAAAAAGTGAAAAATGTTTCCGTTACACCAAAAAATGATGTCATTAATAGGTGTAGTTTTGTTGGGTCACTTGCAGACAATATACCCTCGTGGCGTTGTCACACTGAAGCCATGCAGTACAGAGACAACTATAAATCACTCAAAGAAAAACTGGCCAGGAGATTGTTTGTGGAGTTTAATAAAAATGTATTTGACAATGCCCTCGATGCAGATATGCCTATAATTTGGGACACAAAACTAAGAAGCACTGCCGGGTTCAACGACCTACCGATGGAGGAACAGAGAAGGTTCCAGGTGACAATGGAGACAGTGAAATCAATACATTCCCAGGCCAAGGAGCTGTACTCCAAGAAGAAATACCTAAAGGCCATAAGAAACTACCAGCAATCAGTGTCGATTCTGAACATATCCAACACTCGAGACGAAGAGGAAGAGAACAATGTGAAAGATTTAAAAATTAAAGCCTACATCAACCTCGCTGTCTGTTATTACAAACTCAACAAACCCAAATACGTCCTCAACATGTGCGAAAGCCTAGACTATCTCACGGACACGGACAGACACAGCAAAATATTGTTCTACTACGCCCGAGCTTACGAAATGCTCAACAAATACGACCAAGCCGTGACGTATTACAAGAAAGCACTAAAAATCGAACCGCACAACAAAGAGATAGGCGACGCGTTGACAAAGTTAGATGAATACAATAAAAAATCAGCTGTCACAGAGAAAGAAATATGGCGGAACGCGTTCAAAACGGATATAGATAAGAAGGTACGATATAGCGTCGATCGAGATTTTCAAAATGGCGTCCTTGACATGTGTCAAGAATTGGCGGGAAAGGTCGAGTACTCGAGGTTTGATTTACCAAGCGGCTTGACGAAGGATGAGCTCGAATGTATCAAGGACTTATGTTCAAAATTCGAAGGACTGGTTGTCATGGAAACCGGAGAGGGGAAGAAAAAAAACGTGTCCATCGTAAAGAGTACAACTACAAATAAGCTGATAAAAACGTCGAAAGGCACAAAAATTAGAACATCAAGTATAAAGTTATCGAATAAGGTTGTTGATAATTGTCAAAGATTAAGGGACACTCTCATACACGAACTATGTCACGCTGCGACATGGCTGATAGATGGGGAATTGAGGGCAGGGCACGGACCGCTGTGGAAGAAATGGGCAACCCGTTCTTTAAGGAAGTATCCGGAGTTGGGGGAAATATCGAGATGTCATGACTTGGAGATACATTATAAATACTGTTACAAGTGTACACAATGCGGTTATAGTATCAAACGTCATTCCAAATCCATTGACATAACGAAAAAATGTTGCGGCTACTGTCGCGGGACTTTTGAGATTATCATAAACAAGAAAAACAAAGATGGCGTCGTTGTTTCAACTCCGGCGAGGAAAGGCGGCCCGAACGACTTTGCGCTTTTCGTCAAAGAAAATTATGGATCACATAAGAAAAATGGTAAAACTCACGCTGAGGTCATGAGAGTGTTGGGCGAAGAGTTTTCGGCGCGTAAAAATAGAATGAATGACCGAGTATATGACGATTTAGAATCTTGCAGCGATTAG

Protein sequence:

>DPOGS212002-PA
MDNAKSQNNPRNRFLKLSLKKNGVKQISTQKNMRSKADVIILDDSFDKLRLEFPNASFNATLEFTSPANPKENRPSIGGWSPAQNIVCATPKNDPLPSTPQDKARKSSEQCSIKKIHQSQQKLLSDLYGETWKSIPSLFKTIQENHKNFNGVSKKLNFNDDDKENIRSDLERNKQLYLTESETKRRVDLLNSDKKSKKKLFTEKVPSTPEPPKQKSKINVNTTSKKKKSMTVTELVEVMTNDVNSLSEKVKNVSVTPKNDVINRCSFVGSLADNIPSWRCHTEAMQYRDNYKSLKEKLARRLFVEFNKNVFDNALDADMPIIWDTKLRSTAGFNDLPMEEQRRFQVTMETVKSIHSQAKELYSKKKYLKAIRNYQQSVSILNISNTRDEEEENNVKDLKIKAYINLAVCYYKLNKPKYVLNMCESLDYLTDTDRHSKILFYYARAYEMLNKYDQAVTYYKKALKIEPHNKEIGDALTKLDEYNKKSAVTEKEIWRNAFKTDIDKKVRYSVDRDFQNGVLDMCQELAGKVEYSRFDLPSGLTKDELECIKDLCSKFEGLVVMETGEGKKKNVSIVKSTTTNKLIKTSKGTKIRTSSIKLSNKVVDNCQRLRDTLIHELCHAATWLIDGELRAGHGPLWKKWATRSLRKYPELGEISRCHDLEIHYKYCYKCTQCGYSIKRHSKSIDITKKCCGYCRGTFEIIINKKNKDGVVVSTPARKGGPNDFALFVKENYGSHKKNGKTHAEVMRVLGEEFSARKNRMNDRVYDDLESCSD-