Monarch geneset OGS2.0

DPOGS201858
TranscriptDPOGS201858-TA1713 bp
ProteinDPOGS201858-PA570 aa
Genomic positionDPSCF300191 - 93432-97396
RNAseq coverage246x (Rank: top 42%)
Annotation
HeliconiusHMEL0069872e-17170.82% 
BombyxBGIBMGA006106-TA8e-16374.02% 
DrosophilaCG17068-PA2e-7155.00% 
EBI UniRef50UniRef50_D6WNZ22e-15050.26%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WNZ2_TRICA
NCBI RefSeqXP_974719.24e-15150.26%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|1892378469e-15050.26%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|1700287355e-14747.66%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00055151.9e-22protein binding
KEGG pathway 
InterPro domain[1-130] IPR0113336.3e-27BTB/POZ fold
[22-126] IPR0130691.9e-22BTB/POZ
[29-130] IPR0002102.1e-22BTB/POZ-like
[136-210] IPR0117051.8e-11BTB/Kelch-associated
Orthology groupMCL16461 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201858-TA
ATGGCTTCAGCACCGACAGATTGGCAGCTTGAATGCACTGAATTAAAGCAACGGGGGGCCTACCTTCTTCAAACAGGCCAGTGGTCTGATTGCACCTTCTTAGTAGGAACGGAACCCAACCAAGTAGTGGTGGTCGGTCACAAGCTCATCCTAGCGATGGCGTCTCCGGTGTTCGAAGCAATGTTTTATGGTGGGATGGCTGAACGAAATGAACCAATACCAATCGTAGACGTCCAAATAGATGCCTTTAAAGCTCTTCTAGAATACATTTACACAGGCAATATAAACATTAGTTCATTCGATAAGGCCTGCGAATTATGTTACGGCGCCAAGAAGTATATGCTGCCACATCTGGTTAAAGAATGCACCAGATATTTATGGTCCGATCTGTACCCTAGAAATGCCTGTAGGGCTTATGAATTTGCAAGACTGTTCGAAGAGAATGTGCTCATGGAGAAATGTATTCAGATTATTAGCACCAATACTAAAGAGGTCCTAAATGACAGCAGTTTTGAGGAGGTGGAGCTTAACACGGTCATTACTGTATTCTCTTTAGACCATCTCAATGTTGACAGTGAACTAGACCTGTTTGAAGCTGCTGTGAGATATGCAAAAGCAATGGAAAAAAGGAATGTTGAAAACAACAGCCCCCCAGCTGATGGAGCATCTGCAAGTGAAAATCGTCCAAAGAGTCCCGTGCCTTCAACATCCCAGGAGAAAGAACAGGTGGTGAATATTGATAGTAGTGCTGAAAGTAGCCCAGAGGCAGTCATGGACACTTGCAAATCAAATGAAGCCGAGACAGCGACTCCAACGCAAACCAAACATTTAGATAAGTTGGGAGAGAAGAAAGAAAAGCCATCAATTCGTCGTGCTGTGGAGAAGATTCGTTTCTTGACTCTATCGCCTCAGCAGTTTGCGGCAGGTCCGGCACGTTCGGTGCTGCTCAGTGAGAGTGAAGCCTTTGCTGTCCTCATGAACATCCTCAATGCTCACACTGATGTAGCGTTGCCTGAAGGCTTCTCCACTTCCAGGGTGCCAAGGAAGCAACTGATCAGTTCCAATTGTAATATGCCTACGTTCACCGTGGACACTCCGAGTCCAGTGAGTGTGGAGCAAGTGTGGGGCGAGCTGCATCCCAGACCATCCCGACACGATGGAGTAAAGTTACCCCTTTCAATCCACAATCCCGTCTCCGGTATGGTCGTGTTAGAGCACATGGAGCGTCACAGCGACGGACACAAGATGTACTGTCAGAGGGCGCTGGTGCAGCACACGGACTGTCTTAACACCAACTCGCTGGACTGCTCCGTCACGTTCATGGTAGACAAGAACATTTGTCTTCTGGGAGTTCAGGTGCCGACCCAGGCTCCCAGCGAGGAGTCAGGTGGATATGTCTGTGGTGGTGGTGGTTACTCTGAACTCCTGTACGCACACCTGCTGGATTCTGATGGGGCGAGGCTCACGTACACACACTACACACATCGGGTACCATACCGACATCTGCAGGACATCATGTTCAACAGACCTGTCTACATACAGAAGAATAAGGTCTACAAGGTAGCAATAGTTTTTAACAAGATGGGCTGGTACCCGATTGGTCTGTGCGCACAGCAGGTCACCGCTGAGGGGGTGTCCTTCAACTTCGGCATCGGACACAACTGTGATTCCATCAGGGATGGACTCATCAGATCTATCATATTCACATACTAG

Protein sequence:

>DPOGS201858-PA
MASAPTDWQLECTELKQRGAYLLQTGQWSDCTFLVGTEPNQVVVVGHKLILAMASPVFEAMFYGGMAERNEPIPIVDVQIDAFKALLEYIYTGNINISSFDKACELCYGAKKYMLPHLVKECTRYLWSDLYPRNACRAYEFARLFEENVLMEKCIQIISTNTKEVLNDSSFEEVELNTVITVFSLDHLNVDSELDLFEAAVRYAKAMEKRNVENNSPPADGASASENRPKSPVPSTSQEKEQVVNIDSSAESSPEAVMDTCKSNEAETATPTQTKHLDKLGEKKEKPSIRRAVEKIRFLTLSPQQFAAGPARSVLLSESEAFAVLMNILNAHTDVALPEGFSTSRVPRKQLISSNCNMPTFTVDTPSPVSVEQVWGELHPRPSRHDGVKLPLSIHNPVSGMVVLEHMERHSDGHKMYCQRALVQHTDCLNTNSLDCSVTFMVDKNICLLGVQVPTQAPSEESGGYVCGGGGYSELLYAHLLDSDGARLTYTHYTHRVPYRHLQDIMFNRPVYIQKNKVYKVAIVFNKMGWYPIGLCAQQVTAEGVSFNFGIGHNCDSIRDGLIRSIIFTY-