
Model/Service Name | Description |
typeset.io is an AI chatbot for interacting with research and academic paper PDFs. This tool helps you understand academic papers more easily, explaining complex scholarly texts in plain language. You can highlight text, math, and tables in PDFs, get summaries or clarifications for confusing parts, and dive deeper into complex topics. | |
Captions is an AI-powered creative studio that lets you produce studio-quality videos in just a few taps. With the AI scriptwriter and voice dubber, you can easily turn videos recorded in Korean into English, French, Italian, and more. | |
DeepL is a service that instantly translates text and document files, offering accurate translations for both individuals and teams. It supports 31 languages and can translate PDF, DOCX, and PPTX files. The AI-powered DeepL Write feature corrects grammar and punctuation, restructures sentences, and brings out the nuances in writing style. | |
Speak is an English speaking practice service that lets you practice real conversations without a live tutor. Thanks to GPT-powered situational settings and role play, you can have virtually limitless conversations with an AI tutor about anything, anytime. | |
ChatGPT is a service that provides instant answers, creative inspiration, and learning new things. chatGPT is free to use and available on iOS and Android. It provides users with instant answers, creative inspiration, and the opportunity to learn new things. | |
DALL-E 3 understands much more nuanced and detailed than previous systems. This allows you to easily translate your ideas into highly accurate images. DALL-E 3 is currently in research preview and will be available via API and Labs to ChatGPT Plus and Enterprise customers in October 23. | |
HeyGen is a video and TTS service that creates an avatar that looks exactly like you and lets you speak in various languages. It is an edge service that even converts the shape of your mouth to match the pronunciation of a foreign language. You can speak various languages very easily with a voice and face similar to your own. |
Model/Service Name | Explanation |
GPT-3 is a large-scale language model developed by OpenAI. It was released in June 2020 and has 175 billion parameters. This is more than 100 times that of GPT-2. GPT-3 can perform various language tasks and can generate text that is difficult to distinguish from human writing. Recently, it is said that an updated version of GPT-3, GPT-4, is under development. | |
🇰🇷 👍 GPT-4 is a large-scale language model developed by OpenAI, and is the successor to GPT-3. It was released in March 2023 and has 17.5 billion parameters. This is more than 10 times that of GPT-3. GPT-4 shows improved performance than GPT-3, can generate longer texts, and can perform more complex tasks. In addition, GPT-4 is a multimodal model and can process images and speech. | |
LaMDA (Language Model for Dialogue Applications) is a large-scale language model developed by Google AI. It was released in January 2021 and has 137B parameters, which is one-tenth of GPT-3. LaMDA can perform various language tasks and generate text that is difficult to distinguish from human writing. In addition, LaMDA can understand and express high-level human characteristics such as consciousness, emotion, and creativity. | |
Google DeepMind's Gopher is a large-scale language model with 288 billion parameters. Released in March 2023, it is larger than OpenAI's GPT-3 (175 billion) and smaller than Microsoft's Megatron-Turing (530 billion), but DeepMind explains that it outperforms existing models, and aims to build a slightly more specialized and secure language model. | |
LLaMA is a large-scale language model developed by Meta AI. It was released in February 2023 and has 6.5 billion parameters, which is a quarter of GPT-3. LLaMA can perform various language tasks and generate text that is difficult to distinguish from human-written text. In addition, LLaMA is a multimodal model and can process images and speech. | |
👍 LLaMA 2 is the second version of a large-scale language model developed by Meta AI. It was released in July 2023 and has 7 billion parameters, which is more than 4 times that of LLaMA. LLaMA 2 shows improved performance than LLaMA, can generate longer texts, and can perform more complex tasks. In addition, LLaMA 2 is a multimodal model, which can process images and speech. | |
🇰🇷 The Llama-2-70b-instruct-v2 model was developed by Upstage and uses the backbone model of LLaMA-2. This model performs text generation in English and uses the HuggingFace Transformers library. Unlike LLaMA 2, which has little Korean data, this model is significant in that it is specialized for Korean. | |
Claude 2 is a large-scale language model developed by Anthropic AI. It was released in May 2023 and has 10 billion parameters, which is one-sixth of GPT-3. Claude 2 can perform a variety of language tasks and generate text that is difficult to distinguish from human-written text. Claude 2 is also a multimodal model, and can process images and speech. | |
Vicuna-13B is an open source chatbot trained on LLaMA by collecting user-shared conversations. In early evaluations compared to GPT-4, Vicuna-13B achieves over 90% quality of OpenAI ChatGPT and Google Bard, and outperforms other models in over 90% of cases. Training costs are around $300, and the code and weights are publicly available for non-commercial use. But in practice, it doesn’t have that much power 😉 | |
BLOOM is the world's largest open multilingual language model, with 176 billion parameters and capable of generating text in 46 natural languages and 13 programming languages. It is the first language model with over 100 billion parameters for languages such as Spanish, French, and Arabic, and is the result of a research project involving over 1,000 researchers from over 70 countries and 250 institutions. | |
Alpaca is a 7B parameter language model developed by the Stanford Center for Research on Foundation Models (CRFM), fine-tuned on 52K instruction-following demonstrations based on the LLaMA 7B model. It is surprisingly small, reproducible, and inexpensive (<$600), while achieving similar performance to OpenAI's text-davinci-003. For research purposes only, commercial use is prohibited. | |
🇰🇷 KoAlpaca is an open source language model that understands Korean. It is a project that trained an Alpaca model that understands Korean based on the Stanford Alpaca model. This model can be used in chat-type web pages, KakaoTalk bots, Telegram bots, etc., and provides various Korean-based and English-Korean models. | |
🇰🇷 Polyglot is a large-scale language model developed to improve non-English language performance compared to various multilingual models. This project attempts to model various languages, including Korean data. | |
🇰🇷 Kakao Brain KoGPT API uses KoGPT, a GPT-3-based language model, to understand Korean lexically and contextually and to generate sentences that match the user's intent. It can perform all tasks related to Korean, such as judging whether a given sentence is positive or negative, summarizing the content, predicting the conclusion, answering questions, and writing the next sentence. It is used to solve high-level language tasks such as machine reading comprehension, machine translation, writing, and sentiment analysis. | |
🇰🇷 HyperCLOVA X is Naver's large-scale AI, an upgraded version of the existing HyperCLOVA. By combining the customer's own data with HyperCLOVA X, it can provide immediate responses that meet the user's needs, and it provides functions that improve productivity by becoming a powerful backbone in various areas such as reading, writing, coding, searching, summarizing, consulting, recommending, and planning. |
Model/Service Name | Explanation |
Character.ai is a service that allows users to create their own characters and have conversations with them. Characters can have various characteristics such as gender, appearance, personality, and hobbies, and users can define them themselves. When talking to a character, the character can generate various texts based on the user's questions and requests and respond appropriately to the context. | |
🇰🇷 Nutty Messenger is a messenger that you can use with your AI friend Ruda, and you can enjoy various activities such as daily conversations, games, and sharing your honest feelings. You can build intimacy through conversations with Ruda, and you can enjoy games such as word games and number guessing games, providing a unique experience of building memories with Ruda. (You can talk to speakers such as Iruda and Kang Da-on.) | |
🇰🇷 DearMate is a service where users and AI mates share small moments of happiness in their daily lives, and you can create intimate relationships with various AI chatbots. You can share your daily life and emotions by exchanging DMs with characters such as Coco, Mars, and Bluny, and each character has its own unique personality and characteristics, providing a variety of conversation experiences. | |
ChatSonic is a conversational AI chatbot powered by WriteSonic, with GPT-4-based features. It can have real-time conversations about current events and trending topics, and offers various features such as digital AI artwork generation, conversations with personalized avatars, and content suggestions via Chrome extension. | |
👍 ChatPDF is a service that lets you chat with PDF files. It is free and does not require membership. You can upload any PDF file, such as a book, research paper, manual, essay, or legal contract, and ask a question to receive an answer about the content of the PDF. | |
🇰🇷 CLOVA X is a conversational AI service built on HyperClova X, Naver's large-scale artificial intelligence. It was released on August 23, 2023, and can be used in synergy with Naver Shopping, Travel, etc. through skills (plug-ins). |
Model/Service Name | Explanation |
🇰🇷 👍 Google's Bard is an AI service based on a large-scale language model (LaMDA). It is trained on a huge data set of text and code and can perform various tasks. It is currently available in beta form and provides simple Google WorkSpace integration. | |
Perplexity.ai offers a personal search assistant called Copilot. Upload a text or PDF file (up to 10MB) and it will find answers to a variety of questions, including "History of Argentina", "Unique flowers in Colorado", and "Checkout time at W Hotel CDMX". They also offer a Pro version that allows you to upgrade to GPT-4 to upload more files and improve your experience with Copilot. | |
Komo.ai is a platform where users can ask questions, discuss, and explore topics the community is talking about. The "Ask" function allows you to ask or discuss anything, the "Explore" function allows you to see topics the community is talking about, and the "Search" function allows you to get quick answers or links to resources. | |
🇰🇷 👍 Bing Chat is a chat-based search service that provides real-time search by integrating LLM with Microsoft's Bing search engine. Unlike the initial enthusiastic response, the popularity is cooling off due to restrictions such as the fact that it can only be used on the Edge browser. | |
You.com is an AI-powered search engine that keeps your data 100% private while providing a personalized search experience for you. Personalize your search with over 150 apps, including StackOverflow, Medium, Twitter, and more. Recently, we have enhanced our search with GPT-4 and Stable Diffusion XL. | |
🇰🇷 We provide insights and action items that will maximize your marketing performance based on 3 petabytes of search data secured from Google and Naver. It shows customers' search intents and search paths in high-resolution maps and includes the "GPT Analysis" function. |
Model/Service Name | Explanation |
Jasper | Jasper is an enterprise AI and AI marketing tool that helps teams quickly create blog posts, marketing copy, AI-generated images, and more. It uses the best models, including OpenAI’s GPT-4, Anthropic, and Google’s, and combines them with recent search data, brand voice, SEO, and grammar optimization tools. |
🇰🇷 CLOVA Studio is a no-code AI tool based on the ultra-large AI HyperCLOVA provided by NAVER CLOUD PLATFORM. With this tool, you can easily perform various tasks such as sentence generation, summarizing long articles, classifying sentences or emotions, creating conversational interfaces, and sentence conversion. | |
🇰🇷 Wrtn.ai is an AI portal for everyone, providing various generative AI services. It also aims to be a platform that can use language models such as GPT-3.5, GPT-4, and PaLM2 to chat, create images, and create your own AI. | |
Compose.ai is a Chrome extension that reduces your writing time by 40% with AI-based autocomplete and text generation. It works with Google Docs and Gmail and offers autocomplete, sentence restructuring, email composition, and response generation. It also learns your writing style to provide personalized suggestions and integrates with a variety of tools including email, Slack, Notion, Coda, and more. It is free to use, with a premium version offering advanced features including personalization. | |
Rytr is an AI-powered writing assistant that helps you create high-quality content in seconds, including blog posts, emails, and advertising copy. Choose from over 40 use cases and templates, 30+ languages, 20+ voice tones, and a rich text editor to quickly transform your raw ideas into finished work. | |
🇰🇷 Wordtune is an AI-powered writing tool that helps you write more clearly, persuasively, and authentically. Wordtune understands the sentences you type and suggests better words and expressions based on the context. Wordtune can also check grammar and spelling, and suggest ways to make your sentences more concise and fluent. | |
HyperWrite is an AI-based writing tool that helps users write high-quality content faster and easier. HyperWrite uses cutting-edge AI technology to understand the sentences you type and suggest better words and expressions based on the context. HyperWrite can also check grammar and spelling and suggest ways to make your sentences more concise and fluent. | |
Copy.ai is an AI-powered writing tool that helps users write better marketing copy and content. Copy.ai is built on the GPT-3 language model and helps users write a variety of content types. | |
Hypotenuse AI is an AI-based writing tool that helps users write high-quality content faster and easier. Hypotenuse AI uses cutting-edge AI technology to understand the sentences users type and suggest better words and expressions based on the context. Hypotenuse AI can also check grammar and spelling and suggest ways to make sentences more concise and fluent. |
Model/Service Name | Explanation |
🇰🇷 👍 Microsoft 365 Copilot is an AI-powered productivity tool that helps you get things done faster and easier. Copilot understands what you type and can automate tasks or provide suggestions based on that. | |
🇰🇷 Google Workspace AI Solutions are solutions that help you optimize your business and increase productivity using the AI capabilities of Google Workspace. These solutions provide the following capabilities: Already being applied to document creation, schedule management, customer support, and more. | |
👍 Mem.ai is an AI-based note-taking app that helps users organize their ideas and get new ideas faster and easier. Mem.ai offers the following features: You can create 100 notes per day with a free account. Paid accounts offer more features and usage. | |
🇰🇷 Notion AI is an artificial intelligence feature that can be used within Notion to help you get things done faster, improve your writing, and think big. It automates complex tasks, summarizes important content, analyzes meeting minutes, corrects grammar and spelling, translates to multiple languages, and edits tone. | |
🇰🇷 LINER is an AI-based workspace that understands content faster in the browser and creates new searches. LINER AI provides accurate answers quickly, stores important information in one place, and allows you to access it at any time. You can also enjoy LINER AI on your smartphone, and it provides various functions utilizing GPT-3.5 and GPT-4. You can choose to use the basic and pro versions, and it provides functions such as Copilot functions on YouTube, PDF, and web pages, and use of LINER AI on Google. | |
Tome is a new way to express and share ideas using AI. Instead of staring at a blank page, it creates one-page, presentation, mood board, etc., finds the right tone and words for your writing, searches the web for references, and more. It also deepens and clarifies your already written work, and automatically generates images. | |
👍 Gamma is a new way to present ideas using AI, creating beautiful and engaging content without formatting or designing. Create documents, presentations, and web pages in seconds, style entire decks with a click, and embed GIFs, videos, charts, websites, and more. Plus, it’s readable on any device, measures engagement with built-in analytics, and streamlines collaboration with quick responses and comments. Gamma is a new way to present ideas that’s more visual than documents, more collaborative than slide decks, and more interactive than videos. | |
Loopin is an AI meeting assistant that helps you conduct meetings effectively. It converts meeting recordings into text and automatically generates meeting notes, and links related meetings and notes to make it easy to find important meetings and related notes. Loopin AI’s conversational chat also ensures you don’t miss important details, and automatically shares meeting notes via email, Slack, Notion, etc. Just log in with your Google Workspace account, join a meeting, start recording, and you’ll get human-quality meeting notes after the meeting. | |
🇰🇷 👍 Clova Note is a voice record management service that utilizes AI technology provided by Naver. It can be used in situations where you need to remember conversations such as meetings, conferences, and interviews, and it automatically converts recorded voices into text so that you can easily find and listen to the information you need. It provides various functions such as memo function, bookmark function, AI summary, search, and sharing, and you can organize voice records in an easy-to-view manner, check important conversation moments, automatically summarize, find and check only the necessary voice records, and easily share them with a link. |
Model/Service Name | Explanation |
OpenAI Codex is an AI system that converts natural language into code, and is the model that GitHub Copilot is based on. It supports dozens of programming languages, including Python, and can interpret and execute natural language commands from users. As a successor to GPT-3, it can generate code that works with natural language understanding, allowing you to issue English commands to software and APIs. As a general programming model, it can be used for code transformation, code explanation, and code refactoring. | |
👍 GitHub Copilot is an AI-powered programming assistant that works within your editor to provide suggestions for entire lines or functions. It translates natural language prompts into code suggestions in dozens of languages, helping you spend less time writing boilerplate and repetitive code patterns and more time building software that matters. It integrates directly into editors like Neovim, JetBrains IDEs, Visual Studio, and Visual Studio Code, and helps you work with new languages and frameworks. It’s available for personal and business use, so you can focus on developing faster and doing more satisfying work. | |
Amazon CodeWhisperer is an AI coding assistant trained on billions of lines of code that generates code suggestions in real time, from snippets to entire functions. It supports 15 programming languages, including Python, Java, and JavaScript, and integrated development environments (IDEs) like VS Code, IntelliJ IDEA, and AWS Cloud9, and provides security scanning to help you find and fix security vulnerabilities immediately. It’s free for personal use, and helps you code faster, improve security, and use the tools you love. | |
Phind is a personal programming assistant and search engine designed to help users with their coding. It is currently available in alpha for VSCode and is built in San Francisco. Phind connects you with the developer community. | |
LLM, a technology-based AI, aims to enhance learning modes, focus on knowledge sharing, and find new ways for engineers to spend more time creating creative things. Through various projects such as experiments, research, and vision, we share ideas and opinions on emerging technologies combined with existing platforms and services. | |
The AI Assistant in JetBrains IDEs is a major new feature in all IntelliJ-based IDEs and .NET tools. It leverages generative AI and large-scale language models to build a tight integration of AI capabilities with your code. The AI Assistant offers AI chat, documentation generation, name suggestions, commit message generation, and more, and transparently connects to a variety of large-scale language models based on JetBrains AI services. | |
Code-LMs, commonly known as PolyCoder, provides guidance on how to use large-scale language models in source code. The project trains and publicly releases large-scale neural language models for programs, and describes a variety of models, including PolyCoder. It was made available on Huggingface in October 2022, and provides several models trained on large corpora covering a variety of programming languages. |
Model/Service Name | Explanation |
🇰🇷 👍 👍DALL E 3 is an AI system developed by OpenAI that can generate realistic images and artwork based on natural language descriptions. The system can combine concepts, attributes, and styles to create original realistic images and artwork, and generates more realistic and accurate images with 4x higher resolution. | |
🇰🇷 👍 Adobe Firefly is Adobe's generative AI model that can generate images, vectors, videos, and 3D based on text. You can create images, change colors, apply text styles, and more using simple text prompts in over 100 languages. It also provides innovative features for creators, supporting infinite creativity and enabling the creation of commercially usable content. It is currently available in Adobe Express, Photoshop, etc. | |
🇰🇷 👍 Karlo is a generative AI provided by Kakao Brain that provides the function to create new images based on sentences and images input by the user. Through image-text learning of 300 million images, it understands what the user describes, quickly creates completely new images pixel by pixel, and supports various painting styles and compositions. | |
👍 Stability AI is pleased to announce the public release of Stable Diffusion, opening up a community for developers, creators, and anyone inspired by the technology to join. Stable Diffusion includes an optimized development notebook using the HuggingFace diffusers library, with additional features and API access coming soon, including local GPU support, animation, logic-based multi-step workflows, and more. The model runs on 6.9GB of VRAM and can be used with DreamStudio for faster generation and more control. | |
👍 Midjourney is an independent research lab focused on exploring new mediums of thought and expanding the human imagination. We are a small, funded team focused on design, human infrastructure, and AI, helping to expand, explore, and build infrastructure that enhances the human mind and spirit. The most popular image creation tool in existence. | |
Leonardo.ai is a platform that enables creators to quickly and consistently generate high-quality visual assets using AI. Users can use pre-trained AI models or train their own models to create unique works of art, and can be used in a variety of fields including image generation, 3D texture generation, character design, game assets, and graphic design. Leonardo’s toolkit enables rapid ideation, iteration, and experimentation, taking creativity to the next level from beginner to expert. | |
🇰🇷 Canva is a platform that allows you to create beautiful designs even if you are not a professional, and last year, it introduced the generative AI function. This function consists of 'Magic Write' and 'Text to Image', and although it does not support Korean, it can produce very useful results. In this article, we will test Canva's AI function and explain in detail how to use it. |
Model/Service Name | Explanation |
👍 Runway is an applied AI research company leading the new era of art, entertainment, and human creativity. It provides various AI magic tools for image and video generation, image expansion, image transformation, custom model training, object removal from video, etc., and helps creators work effectively online. Runway focuses on creating platforms and initiatives for the next generation of storytellers, giving everyone the unlimited creativity of AI to tell stories. | |
👍 D-ID is a platform that allows users to generate videos from text and interact with speaking avatars through Creative Reality™ Studio and API. The service provides human-like conversational AI experiences using real-time facial animation and advanced text-to-speech, making it easy to create cost-effective videos for training materials, internal communications, marketing, and more. It also allows you to create personalized and engaging videos using Stable Diffusion and GPT-3, reducing the cost and hassle of video production in 100+ languages. | |
Synthesia is an AI video creation platform that can quickly convert text into video with AI voices supporting 120+ languages and dialects and 140+ AI avatars. You can create professional videos without microphones, cameras, actors or studios, and it can be used for a variety of purposes such as education and development, sales training, technical training, customer service, marketing, etc. The service offers benefits such as cost savings, time savings and increased engagement. | |
Hour One is an AI video generator that allows you to convert text into video in minutes. With over 100 video templates and an AI presenter that supports over 100 languages and dialects, you can create custom videos, saving you money and time on video production while achieving high engagement and improved communication. This service can be used in a variety of areas, including marketing, learning and development, product documentation, human resources, news, and corporate announcements. | |
🇰🇷 👍 CapCut is an all-in-one video editor that works on desktop and mobile, and provides a variety of tools including AI-based video editing effects, filters, voice changers, and automatic subtitle generation. Users can easily create and share videos for business, marketing, social media advertising, and more, and collaborate with teams to work more efficiently. |
Model/Service Name | Explanation |
VALL-E X is an open source implementation of Microsoft's VALL-E X zero-shot text-to-speech (TTS) model. It supports English, Chinese, and Japanese, and provides a variety of features such as voice cloning, emotion control, and intonation control. | |
PlayHT is an online service that converts text into lifelike voices with AI voice generator using over 600 AI voices. It supports 142 languages and dialects and offers various products such as voice over, voice cloning, real-time voice cloning and voice generation API. It can be used in various fields such as marketing, education, games, IVR systems, translation and dubbing, voice accessibility, etc., and you can download audio as MP3 and WAV files. | |
🇰🇷 👍 Typecast is an online AI voice generator that provides over 400 hyper-realistic voices, and can turn text into lifelike voices. Users can choose a character, input text, set a voice style, and then download and use it. It can be applied to various fields such as audio books, education, sales, documentaries, games, etc. You can also adjust emotions and tone to create rich content, and it is simple and easy to use without complex studio settings. | |
MusicLM is a model that generates high-resolution music based on text descriptions developed by Google Research. For example, if you input text such as "real violin melody with distorted guitar riffs in the background", it will generate the corresponding music at 24kHz. MusicLM outperforms previous systems in terms of sound quality and accuracy of text descriptions, and can also change styles based on whistling and humming melodies. | |
ElevenLabs is a company that provides advanced text-to-speech and voice transcription software, and you can explore AI voice generators that can be used as realistic voice overs and text readers. The service can be used in various fields such as videos, games, audiobooks, chatbots, etc., and it can generate high-quality voice audio in any voice, style, and language by rendering human intonation and emotion through AI models. | |
👍 AudioCraft, developed by Meta, is a simple framework for generating high-quality, realistic audio and music from text-based user input. It consists of three models: MusicGen, which generates music from text-based input, and AudioGen, which generates audio from text-based input. It also allows for higher-quality music generation via the EnCodec decoder, and the model weights and code are open sourced for research purposes. | |
Resemble AI is an AI voice generator that converts text to speech and speech to speech, generating over 200,000 AI voices and generating over 2,000,000 minutes of audio per month. The service offers a variety of features such as emotion addition, real-time voice cloning, localization into 60+ languages, combining real voice recordings with synthetic content, deepfake detection, and AI watermarking. It also provides flexible APIs and various integration options for developers, and can be used in various fields such as marketing, education, and entertainment. | |
GOSAYME is a future AI translator based on GPT, and was created with the help of AICodeHelper. Although it currently has limited functions, it is expected to provide translation between various languages as an AI translator. The biggest advantage is that it becomes a voice ping-pong at the level of simultaneous interpretation. |
Model/Service Name | Explanation |
👍 Auto-GPT is an experimental open source project that makes GPT-4 work fully autonomously. It uses GPT-4 to connect LLM "thinking" to automatically achieve a set goal, expanding the possibilities of AI as one of the first examples of GPT-4 working fully autonomously. It offers a variety of functions, including internet search, long/short memory management, text generation, access to popular websites and platforms, file storage and summarization, and plugin extensibility. | |
AgentGPT is an open source project that allows you to assemble, configure, and deploy autonomous AI agents in your browser. Users can create their own custom AI, have it achieve the goals they set, perform tasks, and learn from the results. The service uses technologies such as Next.js, FastAPI, Prisma, SQLModel, and TailwindCSS, and the source code and installation instructions are available on GitHub. | |
GPT Engineer is a project where the AI asks for clear instructions and builds what the user wants to build. The tool generates the entire codebase based on prompts, and allows the user to learn how the AI wants to write code. The user can build the user experience incrementally by providing high-level prompts, providing feedback that the AI will remember over time, and quick handoffs between AI and humans, and the results are generated in a designated project folder. | |
BabyAGI is an example of an AI-based task management system that uses OpenAI and vector databases (Chroma, Weaviate) to create, prioritize, and execute tasks. It creates tasks based on the results of previous tasks and predefined goals, uses OpenAI's natural language processing (NLP) capabilities to create new tasks based on goals, and uses Chroma/Weaviate to store and retrieve task results. This script is a scaled-down version of the original Task-Driven Autonomous Agent. | |
👍 Zapier, a leader in task automation, emphasizes automation powered by artificial intelligence, providing a function that automatically creates tasks when users describe the desired task in natural language. This allows you to customize workflows without writing code, and provides various AI functions such as data formatting, creating chatbots, generating AI prompts in tables, and creating documents. It also works with partners such as OpenAI to integrate with over 5,000 apps to enhance business processes. |
Model/Service Name | Explanation |
Pinokio is a browser that allows users to install, run, and programmatically control terminal apps with one click. You can explore various Pinokio scripts shared by the community, and easily install and run various applications such as audio-related neural networks, text generation web UIs, and stable diffusion GUIs. This service combines artificial intelligence and programming to provide users with a more convenient experience. | |
Karya provides the poor with tasks to read texts in their native language, and collects data to train AI models. The company sells the data at market prices, and returns most of the profits to the rural poor. Karya also gives workers real ownership of the data they create, and provides additional income whenever the data is resold. | |
StableLM is an open source AI language model developed by Stability AI. StableLM is trained on a dataset of 1.5 trillion tokens and can be used for various tasks such as text generation, language translation, and code generation. StableLM is more stable and accurate than existing AI language models. StableLM uses new techniques to reduce bias or noise in training data. StableLM also uses new validation methods to prevent errors. | |
SVC(Singing Voice Conversion) | 👍 SVC is an AI-based singing voice conversion tool that has recently been widely used to create AI impersonators, singers, etc. SO-VITS-SVC is the oldest software, first released in 2022. Diff-SVC was released in 2023, and DDSP-SVC was released in 2023. All three software provide similar functionality, but there are some differences. SO-VITS-SVC is trained with more data than Diff-SVC and DDSP-SVC, and provides better audio quality. Diff-SVC runs faster than DDSP-SVC, and can be used on cheaper hardware. DDSP-SVC is the newest software, and uses the most up-to-date technology. |
Retrieval-based-Voice-Conversion (RVC) is a type of voice conversion. RVC finds a voice similar to the original voice in a voice database and uses that voice as the converted voice. SVC performs the conversion by analyzing the features of the original voice and applying those features to the converted voice. RVC can produce more natural and high-quality converted voices than SVC. This is because RVC searches for voices similar to the original voice in the voice database, so the converted voice can be more similar to the original voice. Also, since RVC does not need to analyze the features of the original voice, the converted voice can sound more natural. |
