添加收藏
 系统管理
 联系方式
您当前位置: 首页 > 机构设置 > 院部设置 > 计算机与通信工程学院 > 研究生培养 > 导师名册

殷苌茗教授 

     2008-04-10

=== 殷苌茗 ===

 

    殷苌茗 ,男,湖南安乡人,1964年5月生,中共党员、博士、教授,硕士生导师。湖南省青年骨干教师。1986年7月本科毕业于北京师范大学数学专业,1998年6月硕士研究生毕业于国防科技大学自动控制专业,上海大学运筹学与控制理论博士研究生。长期从事计算机及应用研发,发表论文近40篇。目前研究方向为计算机软件、人工智能、机器学习、最优控制与决策等。

基本情况简表

姓名:殷苌茗
出生年月:1964.5
民族:汉
政治面貌:中共党员 学历:博士
现任专业技术职务:教授(2007年10月)
通讯地址:长沙市赤岭路45号,长沙理工大学计算机与通信工程学院
电话:(0731)5040667(O),2618828(H),013507341788(M)
email: yinchm@csust.edu.cn 邮政编码:410076



1982年9月~1986年7月 北京师范大学数学系读本科获学士学位
1996年9月~1998年7月 国防科技大学自动控制系读研究生 
1994年2月~1994年12月 国防科技大学六系(计算机科学系)进修
2003年3月~2006年12月上海大学运筹学与控制论博士研究生




1982.7-1986.7  北京师范大学数学专业学习 ,1994.2-1994.7  国防科技大学计算机专业进修,1996.8-1998.7  国防科技大学自动化专业在职研究生,2003.3-2006.12,上海大学博士研究生。自1986年7月起一直在长沙水利电力师范学院、长沙电力学院、长沙理工大学从事教学、科研与管理工作。主讲专业课和专业基础课达10多门,教学效果好。是长沙电力学院计算机专业的主要创始人之一。多年来负责计算机专业的建设和管理工作,曾任计算机专业教研室主任多年。2002年4月至2003年7月,任长沙电力学院数学与计算机系总支副书记,2003年7月至2004年8月,任长沙理工大学物理与电子科学系总支副书记。2005年11月至2006年9月在瑞典Lund大学自动控制系做访问学者。







[1] Reinforcement Learning Algorithm for Solving RTDP with Variational Environment. ICGST International Journal on Artificial Intelligence and Machine Learning (AIML), Volume (7), Issue (I), pp17-21.

[2] Reinforcement Learning Algorithms Based on mGA and EA with Policy Iterations. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Bio-Inspired Computational Intelligence and Applications - International Conference on Life System Modeling and Simulation, LSMS 2007, Proceedings v 4688 LNCS 2007.

[3] Risk-Sensitive Reinforcement Learning Algorithms with Generalized Average Criterion. Applied Mathematics and Mechanics-English Edition, 2007, V28, N3 ( MAR ) , pp405-416.

[4] Global Attractor for KGS Lattice System. Applied Mathematics and Mechanics-English Edition, 2007, V28, N5 (MAC), pp619-628.

[5] Fused SarsalambdaLearning Algorithm Based-on Multi-agent. Journal of Computer Engineering and Applications, 2008, 44 (4), pp182-183.

[6] Automatic Discovery of Subgoals for Sequential Decision Problems Using Potential Fields. 2005 International Conference on Natural Computation/2005 International Conference on Fuzzy Systems and knowledge Discovery (ICNC'05-FSKD'05), IEEE. 27-29 August 2005, Changsha, China. (Lecture Notes in Computer Science, v 3612, n PART III, Advances in Natural Computation: First International Conference, ICNC 2005. Proceedings, 2005, pp384-391)

[7] Optimal Equality for Multi-Time Scale Risk-Sensitive Markov Decision Processes. Proceedings in the International Symposium on Computer Science and Technology 2005, Ningbo, China.

[8] Reinforcement Learning Algorithm Based-on Policy Iteration for Solving RTDP. 2006.8, ISAI’2006, Beijing, China.

[9] U-Clustering: A Reinforcement Learning Algorithm Based on Utility Clustering. Journal of Computer Engineering and Applications, 2005, No.20.

[10] Reinforcement Learning Forgetting Algorithm Based on Dynamic Programming. Journal of Computer Engineering and Applications, 2004, No.20.

[11] The Dynamic Merge Reinforcement Learning Algorithm for Solving POMDP. Journal of Computer Engineering. 2005, 11.

[12] Multi-Time Scale Risk-Sensitive Hierarchical Structure Control Problem. DCABES2006, Hangzhou, China, 2006.10. 

[13] Utility Clustering for Reinforcement Learning with Partial Observability. In Proceedings of Conference of Chinese Intelligence Automatization, HongKong, China, 2003.(IJCAI03).

[14] Average Asymptotic Temporal Difference Learning Forgetting Algorithm on Eligibility Trace, Journal of Changsha University of Electric Power, 2003 (4).

[15] Nonlinear Control Based on Q-learning Algorithms. Journal of Changsha University of Electric Power, Val.18, No.1, 2003 (1).

[16] A Relative Value Iteration Q-Learning Algorithm and Its Convergence Based-on Finite Samples. Journal of Computer Research and Development. Sept.2002, Vol.39, No.9.

[17] Optimality Cost Relative Value Iteration Q-Learning Algorithm Based on Finite Samples. Journal of Computer Engineering and Applications, 2002, No.14.

[18] Generalize Average Algorithm for Reinforcement Learning Its Convergence. Journal of Computer Engineering and Applications, 2002, No.20.

[19] Reinforcement Learning Algorithm Based on average Cost Optimization for Each Stage. Journal of Computer Applications, Val.22, No.4, 2002 (4).

[20] Classification for Un-labeled Context Based on Maximum Expectation Learning Algorithm. Proceedings of 14th CDC (Annul Conference of Control and Decision, China).

[21] ATD(lambda) Learning Forgetting Algorithm. Proceedings of 4th Machine and Electric Engineering Association of Hunan, China, Aug. 2002.

[22] Distributed Real-time System for Electric Power Enterprise Based on Intranet/Web. Journal of Applications of the Computer Systems, 2002(4).

[23] The Uniform of Security Policy in Distributed System. Journal of Information Engineering University, 2001. (Proceedings of Annual Conference of Chinese Networks and Information Security, Zhengzhou,China, 2001).

[24] Design of Distributed Real Time Database System Based on JDBC/Web. Journal of Computer Development and Applications. 2001,No.36.

[25] The Application Delphi Multi-thread for Distributed Real time Multi-task System. Journal of Changsha University of Electric Power, Val.15, No.1, 2001 (1).

[26] Comparing ARP of IPv4 with Neighbor Discovery Protocol of IPv6. Journal of Changsha University of Electric Power, Val.16, No.1, 2001 (1).

[27] Study and Application of Distributed Real Time Multimedia Database. Journal of Changsha University of Electric Power, Val.16, No.2, 2001 (2).

[28] The Design of Real-time Monitor Database System Based on Distributed Heterogeneous Networks Environment. Journal of Changsha University of Electric Power, Val.16, No.3, 2001 (3).

[29] Distributed Real-time Multi-task System Study and Application for Monitoring and Supervising in Electric Power Plant. Proceedings of 1st Machine and Electric Engineering Association of Hunan, China, Aug, 1999.

[30] The Principles and Design Methods for Domain Service System of Campus Networks. Journal of Changsha University of Electric Power, Val.13, No.1, 1998(1).

[31] Security Study for Windows NT Network Management. Journal of Changsha University of Electric Power, Val.13, No.2, 1998(2).

[32] The Weighed Lorentz Norm Inequality of Generalization Maximum Operator. Annual of Hunan Mathematics, Val 17, No.2, 1997.

[33] The Weighted boundary of Operator and its interpolation on Mixed Lebesgue Space. Journal of Changsha University of Electric Power, Val.12, No.3, 1997 (3).

[34] The Alternativeness of Non-Commutative and Non-Combinative Fractional Ring. Journal of Changsha University of Water Resources and Electric Power, Val.8, No.2, 1993 (2).

[35] The Combiner Theory of Non-Commutative and Non-Combinative Fractional Ring. Journal of Changsha University of Water Resources and Electric Power, Val.6, No.2, 1991 (2).

[36] The Equivalence Conditions for Reductionable Elements on Complex Commutative Banach Algebra. Journal of Changsha University of Water Resources and Electric Power, Val.5, No.1, 1990 (1).

[37] F-Set on Unit square-cube under n-Dimension Euclid Space. Journal of Changsha University of Water Resources and Electric Power, Val.5, No.2, 1990 (2).

 





1、火力发电厂分布式数据采集与故障诊断系统,湖南省电力局科研项目(1998年),已结题 ,6万元,主持。
2
、智能体在部分可观测马尔可夫环境下的激励学习研究,国家自然科学基金项目,在 研 ,20万元,主研。
3
、江西省地区电网负荷预测与分析系统,江西省电力总公司, 已结题,50万元,主研。
4
、教学管理软件的开发与推广,长沙电力学院教研项目 (2000年),已结题,0.5万元,主研。
5
、激励学习算法的收敛性研究,湖南省教委科研项目 (2000年),已结题,0.5万元,主研。

6、激励学习智能体最优控制策略及其在微经济环境下的决策问题,湖南省教育厅科研基金项目(2007),在研,1万元,主持。

7、多时间参数风险敏感度MDP研究,长沙理工大学科研基金项目(2006),在研,3万元,主持。




1.     1998年度 获长沙电力学院优秀教师

2.     1998年度 获系优秀毕业实习指导教师

3.     2000年度 获长沙电力学院优秀教师

4.     2000年度 获长沙电力学院优质课奖

5.     2001年度 获长沙电力学院优秀教师

6.     2002年度 获长沙电力学院优秀教师

7.     2002年度 获华中电力集团奖教基金奖三等奖

8.     2003年度 湖南省高等学校青年骨干教师培养对象




曾主讲的主要课程有:高等数学、拓扑学、微分几何、计算方法、离散数学、PASCAL语言程序设计、数据结构、CAD技术、算法设计与分析、计算机网络、JAVA与面向对象程序设计、面向对象技术与可视化编程(Delphi)、软件工程、高级语言程序设计(C语言)、Web数据库及应用、计算机英语、汇编语言程序设计等。




计算机软件、人工智能、机器学习(特别是激励学习方面)
Markov决策过程,部分可观测Markov决策过程
最优控制与决策
计算机网络的最优化问题等等

.::文档附件::.
编 辑:计算机与通信工程学院
供 稿:
:: 长沙理工大学计算机与通信工程学院 ::
地址:长沙理工大学云塘校区理科楼B-108 电话:0731-85258462
Copyright (c) 2009-2010 维护:陈曦小(老师) 许苗华 兰宇识