Monarch geneset OGS2.0

DPOGS212343
TranscriptDPOGS212343-TA1395 bp
ProteinDPOGS212343-PA464 aa
Genomic positionDPSCF300019 - 285848-290116
RNAseq coverage16x (Rank: top 81%)
Annotation
HeliconiusHMEL0101885e-10072.73% 
Bombyx% 
Drosophilapsq-PC5e-10080.48% 
EBI UniRef50UniRef50_E0VAP34e-10370.79%Pipsqueak, putative n=2 Tax=Neoptera RepID=E0VAP3_PEDHC
NCBI RefSeqXP_002423187.18e-10470.79%pipsqueak, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420046382e-10270.79%pipsqueak, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2700040411e-9949.10%hypothetical protein TcasGA2_TC003349 [Tribolium castaneum]
Group
Gene OntologyGO:00036777.5e-15DNA binding
GO:00055151.5e-13protein binding
KEGG pathway 
InterPro domain[361-405] IPR0078897.5e-15Helix-turn-helix, Psq
[193-246] IPR0090571.5e-13Homeodomain-like
Orthology groupMCL16642 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212343-TA
ATGCCGGGAACAGACAGATTCAATGTTAAATGCGAGCCGCCTTCAGACGATGAGGATATTAATTCACAATACGTTTCAACATGCTTACTCAGTGAGAGATCGGAATATTATCCAATTAAATTGGAGAACGTCTACTGTGAAGATCGGTCGTCGGATCTGTTACATCAATTGCAGGGTTCGTTGGAGGAGCTGGCGCCGGCGCCGGCGGGGCCTGCCGCCACACCGGTCCGCGCGGCCTCACCAGCACCCTCACGGACCTCGCTGGCCACGCTCGAACCCGCACAGCACCATGAGCTCGCTTCCTTGTTCACGGCAAAAGAAACCCGAGATAGTGACTGCTACCAAACTTCCGTACTGCCCTTCCCCCCTTTTCGGGAGTTAGTTCAACAACAAATGGTTCAGGGGTCGTCACTTATAGACAGGCACATACAAATGATGACAGCTAATGAAATTGCCGACTCAGCAGAATTCGTACCTCTCATGGCTATCAAAGATGAGCCCACTTCGGAAGGAGAACAGCATCATCTCAGTGGTGACGACACCAGCGACTCCAATGGTCCGGCGCCGGAGCCCGTTCGCGGCAGCCCAAAGACTTGGACCCAACAAGATATGGACAAAGCGTTAGAGGCCCTCCGTAAACATAACATGAGCCTGACTAAGGCGTCTGCGACTTACGGGATACCGTCCACAACCTTATGGCAACGAGCTCATCGCCTCGGCATCGACACTCCCAAGAAAGAAGGCAGTTCGAAGTCTTGGAGCGAAGCGGATCTACGGGGAGCCTTACACGCGTTACGCGCCGGCGCTATCTCCGCCAACAAGGCTAGCAAAGCATACGGCATCCCCAGCAGCACGTTGTATAAGATCGCTCGTCGCGAGGGTATCCGGCTGGCCGCTCCGTTCAACGCGGCGCCCACAGCCTGGCGGAGGGACGACCTGGCGCGAGCCCTGGCCGCCATCAGGGCCGGCGCCGCCTCCGTGCAGAGAGCCGCCGCCACCTACGGCATACCCACCGGTACTCTATACGGAAGGTGTAAGAGAGAAGGTATCGAGCTGTCCCGGTCTAACCCGACGCCGTGGTCGGAGGACGCCATGGGCGAGGCGTTAGAAGCTGTCCGAGTGGGTCAGATGTCCATCAACCAGGCAGCCATACATTACAACCTACCGTACTCGTCCCTGTACGGTCGCTTCAAACGATGCAAATATCAGACGCTACAGAGTCAGCCGCCACAGGAGCTACCGAAATATGAAGGTGAGTTCCATCAACAGGGACTGTACTGTCAACACGCGGACACACACCCGCACGTACACAATACACACACACACATACACAACCTGAACGATATAGACACCTACACACACATGTACTACTCGCACTGTAATGTTACCAGCTGA

Protein sequence:

>DPOGS212343-PA
MPGTDRFNVKCEPPSDDEDINSQYVSTCLLSERSEYYPIKLENVYCEDRSSDLLHQLQGSLEELAPAPAGPAATPVRAASPAPSRTSLATLEPAQHHELASLFTAKETRDSDCYQTSVLPFPPFRELVQQQMVQGSSLIDRHIQMMTANEIADSAEFVPLMAIKDEPTSEGEQHHLSGDDTSDSNGPAPEPVRGSPKTWTQQDMDKALEALRKHNMSLTKASATYGIPSTTLWQRAHRLGIDTPKKEGSSKSWSEADLRGALHALRAGAISANKASKAYGIPSSTLYKIARREGIRLAAPFNAAPTAWRRDDLARALAAIRAGAASVQRAAATYGIPTGTLYGRCKREGIELSRSNPTPWSEDAMGEALEAVRVGQMSINQAAIHYNLPYSSLYGRFKRCKYQTLQSQPPQELPKYEGEFHQQGLYCQHADTHPHVHNTHTHIHNLNDIDTYTHMYYSHCNVTS-