Monarch geneset OGS2.0

DPOGS204773
TranscriptDPOGS204773-TA2532 bp
ProteinDPOGS204773-PA843 aa
Genomic positionDPSCF300231 + 233759-335638
RNAseq coverage19x (Rank: top 80%)
Annotation
HeliconiusHMEL0029369e-13853.30% 
BombyxBGIBMGA013723-TA5e-10266.20% 
DrosophilaCG4050-PA2e-7132.25% 
EBI UniRef50UniRef50_UPI00021A88EF0.048.21%UPI00021A88EF related cluster n=6 Tax=unknown RepID=UPI00021A88EF
NCBI RefSeqXP_973900.26e-16143.11%PREDICTED: similar to smile protein [Tribolium castaneum]
NCBI nr blastpgi|2700015510.048.50%hypothetical protein TcasGA2_TC000396 [Tribolium castaneum]
NCBI nr blastxgi|2700015510.049.58%hypothetical protein TcasGA2_TC000396 [Tribolium castaneum]
Group
Gene OntologyGO:00054881.3e-25binding
GO:00055158.4e-05protein binding
KEGG pathway 
InterPro domain[263-338] IPR0136184.5e-28Domain of unknown function DUF1736
[424-642] IPR0119901.3e-25Tetratricopeptide-like helical
Orthology groupMCL15715 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204773-TA
ATGAAACGAAGACTCCCAATTGTGAAAGAAGAGAAGAGAAGCCCTAAGAATATTTGTAACAGTGATCTTGCTATTTATACGATGGTGATATCAACGGCGGTTTTAAGTTATGTAAACAGTTTGAACGGTGACTTCGTTCACGACGATATACCGGCTATAGTGACGAACGGTGACGTGGTAGGGAGAGGCAGTATTAGAGAGTTGTTTTTGAACGACTTCTGGGGCACGGCGATGGTTGATCCCAATAGCCATAAATCATATCGCCCCTTGACGACGTTATCATTTAGGATAAATTACGCATTAACTGGTTTAAAGCCATGGTGGTGGCACGCTTGCAATGTTCTGTTGCACGCAGCGTGTTGCGCGTTAGTCGCGCGGGCATGCGTGACGATCGCGCGTTTACAAAGACCGTTCGCCGCATTAGCGGCATTGCTGTTTGCGGTACATCCTGTTCATACCGAAGCGGTAGCCGGTGTGGTAGGCAGGGCGGATGTACTCGCTTGCATATTTTTCCTATCCTCGCTTCTAGTTTATCATAGACCGACGAGTAACAAGAAATGCGTATGGTTGAGTATTATTTTGGGAGCTCTGAGTATGCTGGCCAAGGAGACTGGCGTCACAATCCTGTTGCTCAACCTGGCTATTGACTTCTACAGATGTTGGCCATTCGTAAAAAGATCATTATGTACGTTGAAGTTTGAAAAAAAATGTTCTGGTCTTTCAATAAGGACGACAAAAGTGCTTGTATCACTGGCTCTATTGGTTTCTCTTCGCCTGGCGTTGCTACAAGGGACATGGCCAACATTCTCGCCCCAGGACAATCCGGCTTCTTTCCACCCGAGCTTTTTCGTGAGGTTGATGACTTTCTGCTACTTGGCGGCTTTCAATTGGTGGTTACTACTGTGTCCGTGGTCTCTAAGTCACGATTGGCAGATGGGTTCTGTTCCACTCATCGCTAGTGGATGGGATCCCAGAAACTTGTTGACATGTGCTGCTTTTGGTGCTTTGCTGGTCCTTTGCTATAGATGTGTTGCAGATTTAGACGTACAAAAACATACGCCTGCTGTTATTGGCTTACTGTTGCTGGTGGTACCATTCGTGCCGGCCAGCAACCTACTCGTCACCGTTGGATTTGTCATTGCTGAAAGAATTCTTTACATACCTAGCGTTGGAAGCGTCATTATAACAGCCTACGGTGTGCAACTTATGTGGTATTCAAAGCCAGGGACCAAGATATGTTTGATTGTGGGACTTGCGGTGCTCGCTGCTAGCGGTGTGGCAAGAACGTATAAAAGGAATGCCGACTGGAAAGATCGAGCAACATTACTTAGAGCGGATTTAGTGACTTTGCCGCAGAATGCCAAACTGCACTACAACTTTGGCAACTTCCTTCGAGAAACAGAGCAGCAAGACAACGCTATCAAACATTACAAAGAAGCTTTGAGGTTATGGCCGACGTACGCGAGTGCTCACAATAATATTGGCACTCTCAGTAACGCGGAGAACGCAGAACAGCACTTCTTATCAGCGATCGCACACAACAGATACCACGTGAATGCGCACTACAATCTGGCGAAACTTTATAAGAAAGGTGGCCGAATAAACCAGGCTGTGAGAATACTCGAGCGCTGTGTGGTACTCCAGCCACGTTTTGTTCAAGCCTACATTGAATTGCTGTCCTTGAAGCCGGAACCTGAAAAGGCGAGAATATTGGCACGAGTAGTTGAATTGGAGCCCAATAATTGGGAGCATTACATTTTATATGGAAACTGGTATAGGAATAAAGGATTACCGGGAGCCGCCGCAAAATATTTCGTGGAAGCCACAAGACTTAGTTTCAGAAATAGGAATGTTGAAAAAGCAATGAGGGGTGATTTGATCTCACTCCGATCAACGGCGCTTGTATACAGGAGTCTGGGACAAAAATCGAGGGTTCTTCAACTTTTAACCAGATGGCACACTTGGCGTCGTGGTTGGCCGAGTACTGCTGCTGCCCACATGTACTTACAGGAGTGGCGTCTGAAGATGGAGCTAGAAGGGCGAGTTCAGATTTATTCGAAAGCTGTCAATCCAACTAAATCGAAAACCTGTTTCGACCACTCACAACTAGCAGTTGGATCTCCCGCAGCTAATGAAGAAAAAATTTCAAAATACGAGGAAAGAATTAAAGAAGAACAAGTTATAGATAATACAGAACTTGCTTTTGATACAAAATTAACAGTTGATAGAACGTGTGATAGAAAAGAATGTGAGGGTAAACGTGATCTCTCAAAATCGAAAACCTGTTTCGACCACTCACAACTAGCAGTTGGATCTCCCGCAGCTAATGAAGAAAAAATTTCAAAATACGAGGAAAGAATTAAAGAAGAACAAGTTATAGATAATACAGAACTTGCTTTTGATACAAAATTAACAGTTGATAGAACGTGTGATAGAAAAGAATGTGAGGAGCAAGACTATAACGCAGAAGAAAACCGCGACCACCACCACCACGACCACGATGTAGCTACCCCTCCTTTTTTAGCGGCTTAG

Protein sequence:

>DPOGS204773-PA
MKRRLPIVKEEKRSPKNICNSDLAIYTMVISTAVLSYVNSLNGDFVHDDIPAIVTNGDVVGRGSIRELFLNDFWGTAMVDPNSHKSYRPLTTLSFRINYALTGLKPWWWHACNVLLHAACCALVARACVTIARLQRPFAALAALLFAVHPVHTEAVAGVVGRADVLACIFFLSSLLVYHRPTSNKKCVWLSIILGALSMLAKETGVTILLLNLAIDFYRCWPFVKRSLCTLKFEKKCSGLSIRTTKVLVSLALLVSLRLALLQGTWPTFSPQDNPASFHPSFFVRLMTFCYLAAFNWWLLLCPWSLSHDWQMGSVPLIASGWDPRNLLTCAAFGALLVLCYRCVADLDVQKHTPAVIGLLLLVVPFVPASNLLVTVGFVIAERILYIPSVGSVIITAYGVQLMWYSKPGTKICLIVGLAVLAASGVARTYKRNADWKDRATLLRADLVTLPQNAKLHYNFGNFLRETEQQDNAIKHYKEALRLWPTYASAHNNIGTLSNAENAEQHFLSAIAHNRYHVNAHYNLAKLYKKGGRINQAVRILERCVVLQPRFVQAYIELLSLKPEPEKARILARVVELEPNNWEHYILYGNWYRNKGLPGAAAKYFVEATRLSFRNRNVEKAMRGDLISLRSTALVYRSLGQKSRVLQLLTRWHTWRRGWPSTAAAHMYLQEWRLKMELEGRVQIYSKAVNPTKSKTCFDHSQLAVGSPAANEEKISKYEERIKEEQVIDNTELAFDTKLTVDRTCDRKECEGKRDLSKSKTCFDHSQLAVGSPAANEEKISKYEERIKEEQVIDNTELAFDTKLTVDRTCDRKECEEQDYNAEENRDHHHHDHDVATPPFLAA-