Monarch geneset OGS2.0

DPOGS206539
TranscriptDPOGS206539-TA1869 bp
ProteinDPOGS206539-PA622 aa
Genomic positionDPSCF300190 - 117448-121262
RNAseq coverage78x (Rank: top 65%)
Annotation
HeliconiusHMEL0116340.071.34% 
BombyxBGIBMGA005909-TA5e-15462.05% 
DrosophilaAnk2-PU6e-2629.69% 
EBI UniRef50UniRef50_E0VRA74e-8635.58%Ankyrin repeat-containing protein, putative n=1 Tax=Pediculus humanus corporis RepID=E0VRA7_PEDHC
NCBI RefSeqXP_002428651.18e-8735.58%ankyrin repeat-containing protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420160492e-8535.58%ankyrin repeat-containing protein, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420160493e-8635.27%ankyrin repeat-containing protein, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00355566.3e-06intracellular signal transduction
GO:00055153.3e-05protein binding
KEGG pathway 
InterPro domain[215-559] IPR0206836e-88Ankyrin repeat-containing domain
[578-613] IPR0014966.3e-06SOCS protein, C-terminal
Orthology groupMCL20437 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206539-TA
ATGGCAGAGAAGTCACTGGAAGATAGATTATATGACTCAATACTTAAAAATGACCAAGATTCTGTGCTTCAGCTTGTCCTACAAGGTGCCAACCTAAACAAGATAACTAGTCATGGAAAAACATCTCTAGGAGAAGCAGCTAACATTGGAAATTTAGAAATTTTCGAAGTACTAATTGACTATTTTAAATCTACATGCACTCCAAAAAAGTTGTCTCCTAATAGTGCTTCTAAGAAAAGGCATTCAAAGACACAAAAAAGAAAAATGCGTGGAAATTGCCCCAATGAAGATACAGTTATAAAATGTAAAAACACTTGTGATAGAAAAATAAGGTCTAGTCCACTTAATGAATCTGATTCCTTTCAAGATTCTCTTAAGTCTGATAAAAATCAAGGGTACTTTGTATTTATTCATAGCGATGGATCAAGTAGTGATGAAAGTAAACTAAGCTTGAAAACTCCTGTAAGCCCTACATCTGCTGTATCCACACCACAGTTGGACCTGGAATGGGATGAAGAGACAATAAATGTTGCTCCTACGACTAAAGAAGATGAAACATGGTCATCAATGTATCAATGGTATGCTGCAATACTTGAATGTACTGGTGCTGCTATAGCATCAGCTTCTGTGGTGTCTAATGGAATTGACCAACAGGATGCTTTTATGAGAACCGCTTTGCACTATGCAGTTGAACAGGGCCACTCTGAAATTGTAAGGCATCTATTAGATGCTGGTTGTAAAGTAGACCTCGGTCGACCCCTACCGAATTGTAGTTTGCACATAGCCAGTATGAGAAACCATATAGAAATAGTACTGCAATTGTTAGCGGCTGGCGGCAATGTCAACTACAAAACCTATGAGAAGATGACACCATTGCATTTCGCAGTATCAAGGGGTTATTTGAAATTGGTGAAAGTTCTTGTTAGTAATGGAGCTTATTTGGAGGCTCGAGACACAAATGAACGCACAGCCTTATACTTGGCAGCAGGGAGGGGTCATTTGGATGTTGTTAAGTACCTGATATCAGTTGGTGCCAATGTGAACGGTGAAGAAATCAATGGTTATACTCCTCTATGCGAAGCCGTTTGGCAGAGGTATTCGAAAGTTGTTGAGCTTTTGTTAAGTTCCGGTGCTCGCATAACACATTCGCACAAACTTTTGCACAACGCTATTATACAGCGAGAGGAAAAAATCGTAAGAATGTTAGCTAATGTGAGAGGAGGTATTAATTTGCATAATGATAATGGAGACACACCTCTTCTTTTATCAACTCGACTATCCCAACCAGCCGTCGCTAGGATACTACTGCAGAAAGGTGCAAATGTGAACGTCTGTAACAGCATAACAGGGGCGAGTGCGTTACATATCGCCGTAGAAAGCGTCGAGTCTCCCAACAATTTCGAGGAATTGCTGTTATGTTTCCTCGACTATAAAATTGATGTAAACGCCACGGCGTTGACTGGCGACACTGCGCTTAATAGAGCGTTACTTTTGCAAAAAGATCATGCTGCGATTCTATTAATTCGTCATGGCGTGGACGTTAACGCTTGCGATCTTCATTCTTGTGGTTTAGACAATCTAACAATAGCAAGCAAACGTCGATCCAATAAACTCGCTAATATGCTCCTAAAAGCCGGTCATCAAACACATATTTACAGCAAAAATGCTCCAACACCAAAAACAGGAACAACCTCAGATTGGTTGCACCAAGTGTGTAAGCAACCTAATCTGCTGTTAGATTTATGCAGAATTAAAATACGCCAATTGTGTAAAAATAGACCCTTGTATTCTTATGTTAGTTCTTTACCATTACCAAAGAGATTAAAAATATTTCTTATGATGGAAAGTGAAAGTTTAGAAGATACATAG

Protein sequence:

>DPOGS206539-PA
MAEKSLEDRLYDSILKNDQDSVLQLVLQGANLNKITSHGKTSLGEAANIGNLEIFEVLIDYFKSTCTPKKLSPNSASKKRHSKTQKRKMRGNCPNEDTVIKCKNTCDRKIRSSPLNESDSFQDSLKSDKNQGYFVFIHSDGSSSDESKLSLKTPVSPTSAVSTPQLDLEWDEETINVAPTTKEDETWSSMYQWYAAILECTGAAIASASVVSNGIDQQDAFMRTALHYAVEQGHSEIVRHLLDAGCKVDLGRPLPNCSLHIASMRNHIEIVLQLLAAGGNVNYKTYEKMTPLHFAVSRGYLKLVKVLVSNGAYLEARDTNERTALYLAAGRGHLDVVKYLISVGANVNGEEINGYTPLCEAVWQRYSKVVELLLSSGARITHSHKLLHNAIIQREEKIVRMLANVRGGINLHNDNGDTPLLLSTRLSQPAVARILLQKGANVNVCNSITGASALHIAVESVESPNNFEELLLCFLDYKIDVNATALTGDTALNRALLLQKDHAAILLIRHGVDVNACDLHSCGLDNLTIASKRRSNKLANMLLKAGHQTHIYSKNAPTPKTGTTSDWLHQVCKQPNLLLDLCRIKIRQLCKNRPLYSYVSSLPLPKRLKIFLMMESESLEDT-