Monarch geneset OGS2.0

DPOGS214466
TranscriptDPOGS214466-TA639 bp
ProteinDPOGS214466-PA212 aa
Genomic positionDPSCF300122 - 643868-648436
RNAseq coverage225x (Rank: top 44%)
Annotation
HeliconiusHMEL0037451e-0779.41% 
BombyxBGIBMGA013352-TA2e-3368.05% 
Drosophila% 
EBI UniRef50UniRef50_D6W8F27e-1642.54%Nuclear factor 1 n=2 Tax=Tribolium castaneum RepID=D6W8F2_TRICA
NCBI RefSeqXP_971603.11e-1642.54%PREDICTED: similar to Nuclear factor I CG2380-PB [Tribolium castaneum]
NCBI nr blastpgi|2700144962e-1542.54%hypothetical protein TcasGA2_TC001775 [Tribolium castaneum]
NCBI nr blastxgi|2700144962e-2540.81%hypothetical protein TcasGA2_TC001775 [Tribolium castaneum]
Group
Gene OntologyGO:00056341.7e-06nucleus
GO:00063551.7e-06regulation of transcription, DNA-dependent
GO:00062601.7e-06DNA replication
GO:00037001.7e-06sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[46-132] IPR0006471.7e-06CTF transcription factor/nuclear factor 1
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214466-TA
ATCCCTCACAGCTCGGCGGGCCTGAGCGGCCAGTCGCCTCTCGTTCCTTCCAACACCATCTTCTACCAGCACGCCCCGGCCGCCGACACTCACTCCACTCCGTCGGAGTCTCTGAACCAGCACGGGAAATACGAGGGCGGTCAGGACTCGCTCGGGGACTTCGTGACCTTCGTGTGTCAGGAGCCGCCCGCGGACGTGCAGCAGCTACAGGTGCACAGCCTGTCTCGTAGTCCCAAACCGCCGTACTTCAGCAGCTCGATGCTGCCCCCGCCCCCGCTCCCGCCCATGGCCAGGCCGGTCACCATCATCAGATCCACGGCGAGCGAGGCGGGCGGGGGCGGCGGATCCCCCTCGTCCCCGGAGCTGCGCTCGCCGCCCGCGTCTCCGCGCCGCCGCTCCCCTCACTACCGGGACTGCGCCCCCATCAGCCACTTCAACCACTTCCACCAGCCGCAGCAGGTCTGTGTTCAGTTACGGTGCTGTGGGAGGGAGCTCTCCCCCGGGGGGACTCGGCCTGTACGCGCCGCGGGCAGCTCCTCGCGCTCCGCCACGATGGAACGCGCCCTTCCCCACGCTGGAGGACGAGTTCAACATCATGACGGCGCCCGCCGGACCCGACCACGTCGTGTTGCTGGATGA

Protein sequence:

>DPOGS214466-PA
IPHSSAGLSGQSPLVPSNTIFYQHAPAADTHSTPSESLNQHGKYEGGQDSLGDFVTFVCQEPPADVQQLQVHSLSRSPKPPYFSSSMLPPPPLPPMARPVTIIRSTASEAGGGGGSPSSPELRSPPASPRRRSPHYRDCAPISHFNHFHQPQQVCVQLRCCGRELSPGGTRPVRAAGSSSRSATMERALPHAGGRVQHHDGARRTRPRRVAG-