手机扫码接着看

pokertrainer| Media industry: Multimodal GPT-4O release low latency in emotional understanding greatly improves ease of use

Author:editor|Category:Science

Core point of view: event: according to the OpenAI websitePokertrainerOpenAI released the multimodal model GPT-4o and the desktop version of GPT and ChatGPT new UI GPT-4o is an end-to-end multimodal model with significant improvements in non-English text (supporting 50 languages), visual and audio understanding in terms of model capability. Its text and graphics capabilities are now available in ChatGPT for free, and a new version of the voice model with GPT-4o will be launched in ChatGPT Plus in the next few weeks. In terms of cost, the speed of GPT-4o on API is 2 times higher than that of GPT-4Turbo, and the cost is reduced by half. Focus on the core highlights of GPT-4o: (1) multimodal capability to achieve text / audio / image combination input-output, audio and video capabilities significantly improved. (2) the ability to understand is outstanding, and can recognize human emotions and respond "emotionally". And the user can interrupt the model to be closer to the real human conversation scene. (3) low delay and real-time performance: the response time of audio input is only 232 milliseconds, with an average of 320 milliseconds, which is consistent with the response speed of human conversation. At the application level, the release of GPT-4o has greatly improved the ease of use of the large model, making AI assistants more naturally integrated into office / learning scenes and optimized, improving office efficiency and task execution. At the same time, its ability to understand and respond to human emotions will also bring new imagination space to AI emotional companionship, AI social and other races. According to the demonstration, GPT-4o can use the visual ability to identify the code and PDF on the screen in the desktop system, and make corresponding prompts or summaries; in the mobile phone system, it can recognize the emotions in the voice in the conversation, understand the human expression through video, and naturally make emotional changes in line with the scene dialogue, which is more "humanized". Investment advice:PokertrainerWe believe that GPT-4o breaks through many bottlenecks of previous large models in human-computer interaction, greatly improves the ease of use of large models, brings more possibilities for AI applications, further reduces costs, or will accelerate the prosperity of AI applications. GPT-4o 's ability to improve efficiency and entertainment to products are expected to bring product function and form breakthroughs, efficiency aspects, pay attention to the ability upgrading of AI office products, end-side AI intelligent assistant breakthrough; entertainment aspects, pay attention to AI emotional companionship, AI social products and other products after the "personification" and emotional attribute enhancement, the user experience is greatly improved. Continue to pay attention to the empowerment of AI to games, marketing, education, film and television and other industries. It is suggested to pay attention to: (1) Games: it is expected to further refine the game content and improve the production capacity of high-quality games, pay attention to Tencent Holdings, NetEase-S, Sanqi Mutual Entertainment, Kaiying Network, Perfect World, Shenzhou Taiyue, Giant Network, Shengtian Network, Yaoji Technology, Gigabit, Soul Network, wandering Network and so on. (2) Marketing: improve the efficiency and effect of advertising content generation, pay attention to the blue cursor with AI tool product layout, easy to click the world, focus media and so on. (3) Education: GPT-4o shows excellent ability in knowledge understanding and Q & A, and can recognize and understand codes and math problems through visual ability. There are many applications in the field of education, such as Jiafa Education, Video Source Co., Ltd., Century Tianhong, Southern Media and so on. (4) Film and television: it is expected to contribute to the industrialization of the film and television industry & high-quality, pay attention to Huatze Film and Television, Bona Film, Light Media, Lemeng Film and Television, etc. Risk hint: the iterative effect of the model is not as good as expected, the commercialization is not as expected, and the content ethical risk. [disclaimer] this article only represents the views of a third party and does not represent the position of Hexun. Investors operate accordingly, at their own risk.

pokertrainer| Media industry: Multimodal GPT-4O release low latency in emotional understanding greatly improves ease of use

[disclaimer] this article only represents the views of a third party and does not represent the position of Hexun. Investors operate accordingly, at their own risk.

14 05

2024-05-14 19:55:35

浏览30
Back to
Category
Back to
Homepage
bigbassbonanzafreespinsnodeposit| Bank of China and China Eastern Airlines sign strategic cooperation framework agreement randomrouletteonline| Jinke Services spent approximately HK$992,300 to repurchase 105,500 shares on May 14