研究資料首頁-> 期刊論文
研究資料明細
論文名稱 | Markov chain model based on genetic algorithms for texture and speech recognition |
發表日期 | 2006-12-01 |
論文收錄分類 | SCI |
所有作者 | Shiueng Bien Yang |
作者順序 | 第一作者 |
通訊作者 | 否 |
刊物名稱 | Journal of Electronic Imaging |
發表卷數 | 15 |
是否具有審稿制度 | 是 |
發表期數 | 3 |
期刊或學報出版地國別/地區 | NATTWN-中華民國 |
發表年份 | 2006 |
發表月份 | 12 |
發表形式 | 電子期刊 |
所屬計劃案 | 無 |
可公開文檔 | |
可公開文檔 | |
可公開文檔 | |
附件 | Markov chain model based on genetic algorithms for texture and speech recognition .pdf |
[英文摘要] :
Markov chain models (MCMs) were recently adopted in
many recognition applications. The well-known clustering algorithm,
the k-means algorithm, is used to design the codebooks of the
MCM, and then each code word in the codebook is regarded as one
state of MCM. However, users usually have no idea how to determine
the number of states before the design of the MCM, and therefore
doubt whether the MCM produced by the k-means algorithm is
optimal. We propose a new MCM based on the genetic algorithm for
recognition applications. Genetic algorithms combine the clustering
algorithm and the MCM design. The users do not need to define the
size of the codebook before the design of the MCM. The genetic
algorithm can automatically find the number of states in MCM, and
thereby obtain a near-optimal MCM. Furthermore, we propose the
fuzzy MCM (FMCM) and the fuzzy genetic algorithm (FGA) to enhance
the recognition rate. Experimental results show that the proposed
MCM outperforms the traditional MCM and other texture and
speech recognition methods.
[參考文獻] :
1. P. Kenny, M. Lenning, and P. Mermelstein, “A linear predictive
HMM for vector-valued observations with applications to speech recognition,”
IEEE Trans. Acoust., Speech, Signal Process. 38, 220–225
1990.
2. J. Dai, I. G. MacKenzie, and J. E. M. Tyler, “Stochastic modeling of
temporal information in speech for hidden Markov models,” IEEE
Trans. Acoust., Speech, Signal Process. 2, 102–104 1994.
3. B. H. Juang and L. R. Rabiner, “The segmental K-means algorithm
for estimating parameters of hidden Markov models,” IEEE Trans.
Acoust., Speech, Signal Process. 38, 1639–1641 1990.
4. J. T. Chien, “Quasi-Bayes linear regression for sequential learning of
hidden Markov models,” IEEE Trans. Speech Audio Process. 10,
268–278 2002.
5. J. Dai, “Isolated word recognition using Markov chain models,”
IEEE Trans. Speech Audio Process. 3, 458–463 1995.
6. Y. Linde, A. Buzo, and R. M. Gray, “An algorithm for vector quantizer
design,” IEEE Trans. Commun. 28, 84–95 1980.
7. S. Z. Selim and M. A. Ismail, “K-Means-type algorithm: generalized
convergence theorem and characterization of local optimality,” IEEE
Trans. Pattern Anal. Mach. Intell. 6, 81–87 1984.
8. W. H. Equitz, “A new vector quantization clustering algorithm,”
IEEE Trans. Acoust., Speech, Signal Process. 1568–1575 1989.
9. P. Borodatz, Textures—A Photographic Album for Artists and Designers,
Dover, New York 1966.
10. R. C. Gonzalez and R. E. Woods, Digital Image Processing, Addison-
Wesley, Boston 1992.
11. M. N. Shirazi, H. Noda, and N. Takao, “Texture classification based
on Markov modeling in wavelet feature space,” Image Vis. Comput.
18, 967–973 2000.
12. J. Zhang and T. Tan, “Affine invariant classification and retrieval of
texture images,” Pattern Recogn. 36, 657–664 2003.
13. S. Li, J. T. Kwok, H. Zhu, and Y. Wang, “Texture classification using
the support vector machines,” Pattern Recogn. 36, 2883–2893
2003.