To improve prediction accuracy of glycosylation site.A new method is proposed based on principal component analysis(PCA) and independent component analysis(ICA) for prediction O-linked glycosylation site and pattern analysis.Sparse coding scheme of protein sequence is applied when the window size is 51 in this research.PCA is firstly used to reduce dimension and second order correlation.Then ICA is used to extract independent components to construct a subspace(main basis) of protein sequence by training.The test protein sequence is projec...