artificial intelligence

This type of AI that is competent for different tasks without regulation is closer to the way humans think

来源:科技日报 发布时间:2019-12-05 08:43 我要评论 Author: Source: Science and Technology Daily Published: 2019-12-05 08:43 I want to comment

Marcus, a professor of psychology and cognitive science at New York University, recently got on board with DeepMind, an artificial intelligence company. Following the recent question on Twitter of the US General Artificial Intelligence Research Organization's OpenAI's Rubik's cube manipulator, he recently evolved AlphaStar, a new StarCraft 2 agent launched by Deep Thinking. The edition raises six major questions. This time, his point of doubt is not the performance of the game itself, but a higher level: the significance of future general intelligence research.

The coolest results in recent years have come from deep reinforcement learning

The Open Cube Robots launched by OpenAI this time do not use professional algorithms to solve a specific task (if you change a task, you need to reprogram). Instead, you use a certain learning method to train the robot to let Robots have human-like problem-solving capabilities. But Marcus thinks that this description of the results is misleading. A more appropriate description should be "manipulation of Rubik's cube with reinforcement learning" or "progress of manipulating objects with smart robotic hands."

"Marcus puts too much emphasis on 'manipulating Rubik's cube with reinforcement learning'. In fact, both the OpenAI Rubik's cube robot and the" Alpha Star "evolution version of the" Starcraft 2 "agent released by 'Deep Thinking' use deep reinforcement learning technology. Deep reinforcement learning is currently recognized as the most likely technology to achieve general artificial intelligence among existing technologies. "Hao Jianye, associate professor of the School of Software, School of Intelligence and Computing, Tianjin University, explains that there are currently three major branches of machine learning, supervised learning, unsupervised learning, and Reinforcement learning and deep learning are currently the most mainstream type of technology in supervised learning. Deep reinforcement learning is a fusion of deep learning and reinforcement learning, which integrates deep neural networks into the framework of reinforcement learning.

"In recent years, deep reinforcement learning has developed rapidly, and it has shown great potential in dealing with complex, multifaceted and decision-making problems. At present, deep reinforcement learning technology is mainly used in some games and competitions." Hao Jianye introduced, 2016, Google The "Alpha Go" defeated the world's top Go players Li Shishi and Ke Jie, and became a milestone in the field of artificial intelligence. The core of "Alpha Go" lies in the use of deep reinforcement learning algorithms, so that the computer can continuously improve chess power through self-playing. Since then, Facebook has defeated the top professional players in the DOTA2 game; the Texas Holdem AI Coldplay Master developed by the CMU team easily defeated the top players.

In addition, "deep thinking" also uses deep reinforcement learning to optimize the energy consumption of the data center; Google uses deep reinforcement learning to complete the automatic architecture search of deep neural networks, and proposes the AutoML service, which promotes machine learning as a service to Millions of households. In China, there are also many applications of deep reinforcement learning technology. Domestic teams such as Ali, Tencent and Baidu apply deep reinforcement learning to decision-making on practical issues such as search, recommendation, marketing, dispatch, and path planning.

The technology most likely to implement general artificial intelligence

The artificial intelligence has developed to the current height, and the technically big heroes should belong to deep learning algorithms. Deep learning uses multi-layer neural networks to learn from massive amounts of data, so as to realize future predictions and make artificial intelligence systems more and more intelligent. At present, the security monitoring, automatic driving, voice recognition, Baidu map, etc. we are applying are all applications of deep learning technology in image vision, speech recognition, natural language understanding and other fields.

Reinforcement learning is also a hot technology in the current machine learning field. Unlike supervised learning based on known label training models, reinforcement learning can achieve autonomous learning like a human without explicit instructions from a computer. When a certain amount of learning is reached, the reinforcement learning system can predict the correct result. "The basic idea of reinforcement learning is to learn which behavior can maximize the expected benefits under different environments and different states." Hao Jianye introduced that the new version of the "Alpha Star" agent uses self-combat technology of reinforcement learning, and its learning The process does not require data annotation, but is dominated by the reward function. The agent gets a reward score or wins a game, it will get positive feedback, and the agent will adjust its behavior according to the performance of the battle. This is like a baby learning to walk, and it will adjust its behavior according to the results.

At present, the definition of general artificial intelligence has two characteristics, one is end-to-end learning, and the other is task adaptation, which is competent for different tasks without human participation in regulation. Deep reinforcement learning can combine the perceptual ability of deep learning with the decision-making ability of reinforcement learning and control directly based on the input information. It is an artificial intelligence technology closer to the way of human thinking. In the normal course of interaction with the world, reinforcement learning uses rewards to learn through trial and error, which is very similar to the natural learning process. For example, a one-handed Rubik's cube robotic hand may need to see the Rubik's cube using deep learning's image recognition technology, and then need to strengthen the learning model to allow the robotic hand to learn autonomously in the process of continuous trial and error. In reinforcement learning, less training information can be used. The advantage of this is that it has more information and is not limited by the skills of the supervisor. Deep reinforcement learning is another step towards building an autonomous system with a higher level of understanding of the world. This is why deep reinforcement learning is currently recognized as the most likely technology to implement general artificial intelligence in existing technologies.

Future general artificial intelligence needs to rely on brain science development

"Although it is said that deep reinforcement learning technology is most likely to realize general artificial intelligence, it cannot be said that it will certainly be achieved. We are still far from the true general artificial intelligence." Hao Jianye said that when deep learning and reinforcement learning are combined The enumeration of the real situation becomes the first need to identify the real situation and then perform the enumeration of the limited mode, thereby reducing the computational pressure, but the required data will be much larger than other machine learning algorithms. If the scene is extended to multi-agent deep reinforcement learning, the required data and computing power will increase exponentially. At present, there is no platform that can provide the massive data required for reinforcement learning. Various complications. This kind of data requirement cannot be realized in many real-world fields.

For example, for example, reinforcement learning requires a lot of trial and error. If a one-handed Rubik's cube robot is applied to the actual scene of cooking, it may make the ingredients one place or pour a whole bag of salt into the pot. It may also cause fire. Therefore, the mode of trial and error learning cannot be realized in real scenarios.

In addition, deep learning and reinforcement learning are the most difficult to debug successfully in the field of machine learning. Its success cases are actually not many, but once launched, they will cause a sensation. Moreover, this is a model framework that even random seeds will greatly affect the learning effect. For the same model, training 10 times may fail 7 times and 3 times succeed. Another point is that deep reinforcement learning is extremely easy to overfit into the current environment of the agent's interaction, so the environment is slightly changed. The agent that looks good before is likely to make low-level errors.

"When human beings know things, they usually use data to make causal inferences and judgments to arrive at corresponding solutions. However, current artificial intelligence systems cannot achieve this kind of causal inference." Hao Jianye said that general artificial intelligence may be used in the future. The development of human brain also needs to rely on the development of brain science. At present, our cognition of the human brain is still at a very early stage. The brain's cognitive process of things, the problem-solving process, and the ability to think are still unclear. Therefore, the current development of artificial intelligence still has a long way to go before this universal artificial intelligence, which can truly simulate human intelligent thinking. To go.


Editor-in-Chief: Mary
Related Links

First World Display Industry Conference

As the important part of the 13th China (Hefei) International Household Appliances and Consumer Electronics Expo jointly sponsored by the Ministry of Industry and Information Technology, China Council for the Promotion of International Trade, and the Anhui Provincial People's Government, the first World Display Industry Conference Hefei opens.

2019 World VR Industry Conference

On October 19, the 2019 World VR Industry Conference, jointly sponsored by the Ministry of Industry and Information Technology and the Jiangxi Provincial People's Government, was grandly opened in Nanchang. At the opening ceremony, Liu He, a member of the Political Bureau of the CPC Central Committee and vice premier of the State Council, delivered an important speech.

The 2nd Global IC Entrepreneurs Conference and the 17th China International Semiconductor Expo

On September 3, the 2nd Global IC Entrepreneurs Conference and the 17th China International Semiconductor Expo were hosted by the Ministry of Industry and Information Technology and the Shanghai Municipal People's Government and hosted by the China Semiconductor Industry Association and China Electronics and Information Industry Development Institute. Opening in Shanghai

2019 World VR Industry Conference Press Conference

On June 20, the Ministry of Industry and Information Technology and the Jiangxi Provincial People's Government jointly held a press conference in Beijing to introduce the situation and preparations for the 2019 World VR Industry Conference.

The 11th Central China Investment and Trade Expo Electronic Information Industry Development Forum

On May 19th, the 11th Central China Investment and Trade Expo Electronic Information Industry Development Forum was held in Nanchang, Jiangxi. Yang Wenbin, deputy mayor of Nanchang Municipal People's Government, and Wang Yibin, deputy director of Jiangxi Provincial Ministry of Industry and Information Technology attended and addressed the event.

2019 World Ultra HD Video Industry Development Conference

On May 9, the 2019 World Ultra HD Video (4K / 8K) Industry Development Conference was held in Guangzhou. The conference was co-sponsored by the Ministry of Industry and Information Technology, the State Administration of Radio and Television, the Central Radio and Television Station, and the People's Government of Guangdong Province. Minister of Industry and Information Technology Miao Wei attended the meeting and delivered a speech.

The 7th China Electronic Information Expo

The theme of this summit is "Innovation Drives Development and Wisdom Empowers the Future". Government industry authorities, well-known experts and scholars and entrepreneurs at home and abroad will be invited to give keynote speeches, delve into new models, new kinetic energy and new paths of industrial innovation and development, and promote electronics. High-quality development of the information industry. ...

The First Global IC Entrepreneurs Conference

On December 11, under the guidance of the Ministry of Industry and Information Technology and the Shanghai Municipal People's Government, hosted by the China Semiconductor Industry Association and China Electronics and Information Industry Development Research Institute, and hosted by Beijing CCID Conference and Exhibition Co., Ltd., China Electronics News Agency, and Shanghai Integrated Circuit Industry Association "The First Global IC Entrepreneurs Conference and the 16th China International ...

2018 World VR Industry Conference

On October 19, the 2018 World VR Industry Conference was grandly opened in Nanchang. At the opening ceremony, Lu Zhanong, vice chairman of the CPPCC National Committee, delivered a congratulatory letter from President Xi Jinping, Minister of Industry and Information Technology Miao Wei, Secretary of the Jiangxi Provincial Party Committee Liu Qi, Jiangxi Provincial Party Committee Standing Committee Member, and Nanchang Municipal Party Committee Secretary Yin Meigen delivered speeches.

The Second China Virtual Reality Innovation and Entrepreneurship Competition Launched ...

On August 20, under the guidance of the Office of the China Innovation and Entrepreneurship Competition Organizing Committee, a press conference on the launch of the second China Virtual Reality Innovation and Entrepreneurship Competition, jointly organized by the Virtual Reality Industry Alliance and Guoke Innovation and Venture Investment Co., Ltd. was held in Beijing.

Press conference on home grid purchase analysis report in the first half of 2018

On August 2nd, the CCID Research Institute of the Ministry of Industry and Information Technology and China Electronics News issued the "Home Network Purchase Analysis Report for the First Half of 2018" (hereinafter referred to as "Home Network Purchase Report") in Beijing. The report shows that in the first half of 2018, the scale of China's B2C grid purchase market (including mobile terminals) reached 264.1 billion yuan ...

2018 Manufacturing “Double Innovation” Summit Forum

On June 22, the 2018 “Double Innovation” Manufacturing Forum was held in Beijing. This forum is hosted by China Electronics and Information Industry Development Institute, China Manufacturing Enterprise Innovation Development Alliance and China Software Industry Association Industrial Internet Branch, China Electronics News Agency, Beijing Yundao Intelligent Manufacturing Technology Co., Ltd. and China Shipbuilding Industry ...

2018 World VR Industry Conference Press Conference

On May 21, the reporter learned from the press conference of the 2018 World VR Industry Conference held in the Great Hall of the People in Beijing that it was co-sponsored by the Ministry of Industry and Information Technology and the People's Government of Jiangxi Province, the China Electronics and Information Industry Development Institute, and the Jiangxi Province Industry And Information Committee, Nanchang Municipal People's Government, Virtual Reality Industry ...

Learn and implement the spirit of the Fourth Plenary Session of the 19th CPC Central Committee

The Fourth Plenary Session of the 19th CPC Central Committee drew a grand blueprint for adhering to and improving the socialist system with Chinese characteristics, advancing the modernization of the national governance system and governance capabilities. Unifying the thinking into the spirit of the plenary and implementing actions into the major decisions and deployment of the plenary are the important political tasks and major strategies facing the whole party ...

China's home appliance market report released in the first half of 2019

On July 29, the China Electronics and Information Industry Development Institute released the "Report on China's Home Appliance Market in the First Half of 2019" (hereinafter referred to as the "Report") in Beijing. The "Report" shows that from January to June 2019, China's home appliance industry operated steadily and made steady progress, with both production and sales higher than the same period last year. among them,...

A new era of magnificent 70 years of struggle

For 70 years, the Chinese Communists have never stopped the "rush test." On the land of Divine Land, a splendid picture of development is being depicted, and a lot of epic struggles are being written. ...

2019 to create an upgraded version of the "double innovation" manufacturing industry

Focusing on the connotative nature, development trends, and practical implications of the “double innovation” upgrade of the manufacturing industry, leaders of government departments and institutions, academicians and experts from scholars of the two academies, key regions and business leaders are invited to write related articles to discuss the manufacturing industry together "Shuangchuang" new development path. ...

Focus on the National Two Sessions of 2019

March 3rd, the condensed consensus composes the era chapter, and the common country is a picture of revival. The second meeting of the Thirteenth National Committee of the Chinese People's Political Consultative Conference opened in the Great Hall of the People on the afternoon of the 3rd. ...

CEN Video News | 2019 CITE Senior Vice President, Renesas Electronics

At the 7th Electronic Information Expo, Toshiba Makoto, senior vice president of Renesas Electronics, was interviewed by a reporter from China Electronics News. ...

Video news 丨 Academician experts talk about intelligent connected cars

On March 19, on the eve of the Shanghai Electronics Fair in Munich, the Forum on Innovation and Development of New Energy and Intelligent Connected Cars was held in Shanghai. The forum invited more than 30 industry professionals at home and abroad and nearly 35 wonderful speeches. Large field exhibition ...

Interview with the two conferences 丨 Wang Jing, member of the CPPCC National Committee and president of Newland Technology Group

During the National "Two Sessions" in 2019, Wang Jing, a member of the National Committee of the Chinese People's Political Consultative Conference and president of Newland Group, said in an interview with the reporter of "China Electronics News": Traditional rural areas need to play the role of the carrier and engine of information technology and the Internet to promote agricultural production, ...

Links
About Us | Contact Us | Advertising Services
Electronic Information Industry Network Logo

a 在线久久2019