Monarch geneset OGS2.0

DPOGS211428
TranscriptDPOGS211428-TA1218 bp
ProteinDPOGS211428-PA405 aa
Genomic positionDPSCF300115 + 551294-552511
RNAseq coverage108x (Rank: top 60%)
Annotation
HeliconiusHMEL0209956e-14470.97% 
BombyxBGIBMGA004680-TA1e-13263.17% 
Drosophilatap-PA2e-4236.25% 
EBI UniRef50UniRef50_O168673e-4036.25%Basic helix-loop-helix neural transcription factor TAP n=12 Tax=Drosophila RepID=TAP_DROME
NCBI RefSeqXP_002085255.11e-4136.16%GD14703 [Drosophila simulans]
NCBI nr blastpgi|1955910483e-4036.16%GD14703 [Drosophila simulans]
NCBI nr blastxgi|1571037152e-3941.38%target of poxn [Aedes aegypti]
Group
Gene OntologyGO:00056344.2e-20nucleus
GO:00063554.2e-20regulation of transcription, DNA-dependent
KEGG pathway 
InterPro domain[134-190] IPR0115984.2e-20Helix-loop-helix DNA-binding
[139-191] IPR0010923.4e-15Helix-loop-helix DNA-binding domain
Orthology groupMCL18136 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211428-TA
ATGTACGCGACGGACTTCGACGATTACTGCGACTTCAACGACAGCGTCTGCAGCAATGATTCGGGCTTCGAGAGGTCTCATGCTGACTCCCACGCAACACCCTCATCTGTGAACACTACTCTCAATTTTGATACATCAGACTTAGCTATATCTGTGATATCAAAAGATTCAACTGTCAGGAGAAAACTCTTTCATGACGACTACACCTACATCTTCCCAGATCAGATAAAGGAGGTCACGGAATTCGAAGATTTTAAGCCGATAGATAACACGTCGACACCGATAAAGAAAGAGAAAAAACCTAAAGATCCGAACAAACCCAAGAGGAAATACGCAAACGGAAAGAACAGAGTGACGAGATGTAAAAGTCCCACTCAGATATTGAGAATAAAGAGGAATAGAAGGATGAAGGCCAACGACAGGGAGAGAAACAGAATGCACATGTTGAACGAAGCCTTGGACAGGTTGAGATGTGTCTTACCAACGTTTCCAGAAGACACGAAGCTGACCAAAATAGAAACATTACGTTTTGCACACAATTATATATTTGCTTTAAGTCAGACGCTTGAATCTCTAGATAATATTAATTGTGGACAACCCAGCGCCGAATATAATAATTACGACAAACTAACTTGCACTGCCGAGAAAGTTAACAAAGACGCCTTCAGGGAGATATTTCTTCCAAACAAGGACGAGTGCGGTGATGGATACAGGAGTTTCCAAGGGTATAGTAAGCCGTTCCCTAACGGATCGAATTTCTTACAAACTTCCGAAGGAGTGTTGATAAATGTTGGAAATGTAACGGTGTCTGTGAATAATAAAGGCGGTAACTGTATCACTTCGACGACCGGGAGCGGCTTCTTCTCTCATCCCTCGAGTCTAGCGGACGACATACATCAAGGTTACCCTCAAAGGCCTTACGATATCACCAGCTACACGGAAAGGTATGATCCCAGGATGCAAAATGCATCGACGGAATACTTTAATCACAAAAATTATGAAATATTCAAAAACGCTTTTGAAACAGCGAAAAATAGGAAGCAAGTCAGCGCCGTCCAGTACAATAATTACACGAATTTTACAAATTCCTATCACTGCAACGAGAAATATAGTTACACAGACGAGAGTTGTTATCCTCAAAGTAACTATTATAATGATCAAAGATACGTAGGTAGAGATTTTTATAGAGGTCCTAGCATGGTCAATGCACAAATTTAA

Protein sequence:

>DPOGS211428-PA
MYATDFDDYCDFNDSVCSNDSGFERSHADSHATPSSVNTTLNFDTSDLAISVISKDSTVRRKLFHDDYTYIFPDQIKEVTEFEDFKPIDNTSTPIKKEKKPKDPNKPKRKYANGKNRVTRCKSPTQILRIKRNRRMKANDRERNRMHMLNEALDRLRCVLPTFPEDTKLTKIETLRFAHNYIFALSQTLESLDNINCGQPSAEYNNYDKLTCTAEKVNKDAFREIFLPNKDECGDGYRSFQGYSKPFPNGSNFLQTSEGVLINVGNVTVSVNNKGGNCITSTTGSGFFSHPSSLADDIHQGYPQRPYDITSYTERYDPRMQNASTEYFNHKNYEIFKNAFETAKNRKQVSAVQYNNYTNFTNSYHCNEKYSYTDESCYPQSNYYNDQRYVGRDFYRGPSMVNAQI-