Skip to main content

šŸŸ¢ 2023: The Year of AI

Reprinted with citation; will remove if copyright issues arise. Original source

AI has undoubtedly made waves in 2023 and here we spotlight the most significant stories of the year poised to shape the future of this groundbreaking industry:

Correction: In the original blog post published on December 22, 2023, the title ā€œAI Releasesā€ caused confusion as the content encompassed announcements and updates in addition to releases. We clarified the title of the text and infographic. The mention of Stability AI open-sourcing its LLM was excluded from the infographic but left in the article, underscoring its significance in promoting accessibility rather than focusing on tech improvement. The infographic initially featured the establishment of the xAI startup, now removed because of irrelevance. Additionally, the mention of Apple Vision Pro was excluded as the article focuses on software. We also included Midjourney V.6 in the list as it is a very recent release. These adjustments aim to improve accuracy and coherence. We apologize for any confusion and appreciate your understanding!

AI Advancementsā€‹

In the landscape of AI advancements this year, notable progress was made, refining existing technologies rather than introducing groundbreaking innovations akin to theĀ ChatGPT or image generators of the previous year. While there was no wow effect and the real Artificial General Intelligence (AGI) is still far away, this year marked an intermediate stage between prior breakthroughs and something even more powerful to come. To showcase this evolution, we crafted a visual timeline, highlighting the most remarkable AI advancements that have shaped this year of AI:

2023: The Year of AI

Image Generation

  • Adobe Firefly:Ā Adobeā€™s FireflyĀ andĀ Generative FillĀ empowered diverse visual content creation, including illustrations, art concepts, and photo manipulation.Ā Integrated into Photoshop, Adobe Firefly democratized AI, extending its power to a broad user base at once. The release of theĀ Text Effect featureĀ also marked a significant stride, allowing users to apply styles or textures to words and phrases.
  • Midjourney: Midjourneyā€™s V.5 model marked a milestone in image generation, showcasing improved efficiency, coherence, and higher resolution. The latest alpha-version, Midjourney V.6, brought additional enhancements such as more accurate prompt following, increased model knowledge, and minor text drawing ability.
  • DALLĀ·E 3:Ā Built on ChatGPT,Ā DALLĀ·E 3Ā simplified image generation, eliminating the need for complex prompt engineering. In addition, ChatGPT introduced a feature to help users refine prompts and make image adjustments based on feedback.
  • Shutterstock.AI:Ā The stock image giantĀ integrated AI capabilities, allowing users to transform prompts into license-ready imagery. Recognizing and rewarding contributing artists, Shutterstock made the first step in ethical AI.

2023: The Year of AI

The Evolution of Text-to-Image Algorithms, 2007 vs 2023

Video Generation

  • Stability AI:Ā Stability AIĀ introduced Stable Video Diffusion, a groundbreaking model for generative video, with open-source access on GitHub. Drawing a parallel toĀ AI image generation trends, itā€™s highly possible that the Stable Video Diffusion model will play a pivotal role in the creation of a significant portion of AI-generated videos.
  • HeyGen:Ā AI startup unveiledĀ a tool for voice cloning, lip movement adjustments, and language translation in videos.
  • Runway Gen-2:Ā Runway launched the Gen-2Ā model, enabling users to effortlessly generate full-blown videos from just text prompts, images, or other videos. Just have a look at the example below.Ā 
  • Pika and Pika 1.0: With its initial release, Pika garnered half a million users, generating millions of videos weekly. Then upgraded AI model inĀ Pika 1.0Ā empowered users to create and edit videos in various styles, including 3D animation, anime, cartoon, and cinematic.
  • Codec avatars by Meta:Ā Metaā€™s Pixel Codec AvatarsĀ (PiCA) model for 3D human faces in videos brought us closer to photorealistic telepresence.

Text Generation

  • Bard and Gemini:Ā Googleā€™s BardĀ added human-like emotion and sentiment to the chatbot landscape. Introduced into Bard chatbot and trained on a multimodal dataset,Ā Googleā€™s GeminiĀ emerged as the ā€œmost capableā€ AI model and the closest competitor to OpenAIā€™s ChatGPT.
  • Grok:Ā Elon Muskā€™s startup xAIĀ signaled a commitment to AI development, potentially competing with OpenAI, byĀ unveiling ā€œGrokā€Ā ā€” a chatbot with humor, rebelliousness, and real-time knowledge via the š• platform. The xAI promised that Grok was designed to answer provocative questions rejected by other AI systems.
  • OverflowAI:Ā Stack Overflowā€™s OverflowAIĀ enhanced knowledge curation, enabling AI-powered search for relevant answers in Visual Studio Code and Slack.
  • Llama 2:Ā Meta released Llama 2, the next generation of its open-source large language model, showcasing enhanced efficiency. Metaā€™s fine-tuned LLM was also optimized for dialogue use cases and outperformed other open-source models on most benchmarks.
  • GPT-4:Ā OpenAIā€™s GPT-4Ā now handles image input, generates captions, classifications, hears, and responds in a back-and-forth conversation, and supportsĀ real-time web browsing. OpenAI also extended support for plugins, fostering a landscape enriched with open-source competitors. GPT-4 is the next step in OpenAIā€™s journey to develop AGI.
  • Mistral 7B:Ā Mistral AI,Ā valued at around $2 billionĀ this year, released Mistral 7B, a large language model challenging GPT-4 and Claude 2. Emphasizing an open technology approach, Mistral AI offered its model for free download.
  • Mixtral 8x7B:Ā Mistral AI also introduced Mixtral 8x7B, a high-quality sparse mixture of expert model (SMoE) with open weights, featuring 46.7B total parameters, pioneering openness in models with enhanced truthfulness and reduced biases.
  • Yi-34B llm:Ā Valued at $1 billionĀ this year, Kai-Fu Leeā€™s startupĀ 01.AIĀ released Yi-34B ā€” an open-source neural network that outperformed competing models with significantly higher parameter counts, emphasizing its cost-efficiency.

Other Advancements:

  • Segment Anything Model (SAM):Ā Meta AI presented SAM, a segmentation model capable of ā€œcutting outā€ objects in images without additional training, underscoring its adaptability. SAM was trained on a vast dataset, showcasing its robust performance in object segmentation.
  • Direct Preference Optimization (DPO):Ā DPO emergedĀ as a stable and efficient method for fine-tuning large-scale unsupervised language models and teaching text-to-image models. It achieved precise control without complex reinforcement learning from human feedback (RLHF).
  • Zephyr Direct Distillation of LM Alignment:Ā Zephyr-7B, a result of distilled direct preference optimization (dDPO), set the benchmark for chat models with 7B parameters, enhancing intent alignment without extensive training.
  • Autonomous AI Agents:Ā Autonomous AI agents emergedĀ as a notable trend, showcasing a transformative shift toward advanced and autonomous AI systems. AI Agents are considered a first glimpse of AGI as they can generate self-directed tasks and instructions based on a userā€™s goal, and work on them autonomously until the goal is achieved.
  • EvoDiff:Ā Microsoftā€™s EvoDiff, an open-source AI framework for fast and cost-saving protein generation, promised advancements in therapeutics and industrial applications.
  • Stable Audio:Ā Stability AI launchedĀ a tool for generating short high-quality audio clips from simple text prompts.
  • GPT Store, Copyright Shield, ChatGPT Bot Constructor:Ā OpenAI introducedĀ the GPT Store to sell custom GPT bots, Copyright Shield to cover legal costs related to copyright infringement claims, and a no-code platform for custom ChatGPT versions.
  • Stability AI Open-Sourced its LLM:Ā Stability AI has open-sourced its models, StableLM-Alpha and Stable Vicuna, renowned for their impressive performance in generating text and code. Stable Vicuna is the first open-source chatbot trained using reinforcement learning from human feedback (RLHF). Furthermore, Stability AIĀ unveiled SDXL Turbo, a real-time text-to-image generation model.

Partnershipsā€‹

In the dynamic realm of 2023, significant collaborations have surfaced among industry leaders, shaping the trajectory of the future. Here are the top merges and partnerships that were defining the AI landscape in this year 2023:

Stability AI and Init ML

Stability AI has made a significant move byĀ acquiring Init ML, the brains behind the popular editing app ClipDrop. The objective was clear: integrate Stability AIā€™s advanced technologies into ClipDropā€™s ecosystem. The collaboration has already resulted in the development of SDXL Turbo.

Runway and GettyĀ Images

Runway has joined forces with Getty ImagesĀ in a strategic partnership to introduce a new video generation model RGM (The Runway and Getty Images Model). The model combines Runwayā€™s AI capabilities with Getty Imagesā€™ licensed creative content library. The collaboration aims to revolutionize content creation workflows, enabling companies to generate high-quality, customized videos tailored to their brand identities.

Snowflake and Neeva

Snowflake, a major player in the data warehouse platform,Ā has acquired Neeva, a startup known for using generative AI to enhance the search experience. Neeva had recently closed its subscription-based, ad-free search engine. The founders of Neeva also acknowledged the challenge of convincing users to try a new search engine.

Shutterstock and OpenAI

Shutterstock and OpenAI have committedĀ to an extended 6-year partnership. OpenAI gained access to high-quality data from Shutterstock, enriching its model training datasets with a diverse range of images, videos, and music libraries. Shutterstock continued to leverage OpenAIā€™s technologies, leading to the launch of Shutterstockā€™s AI image-generating tool.

In the ever-evolving legal realm of AI, 2023 finds itself amidst a landscape filled with uncertainties and ongoing debates. As new challenges emerge, discussions surrounding copyright, corporate policies, and the broader regulatory framework continue, shaping the contours of AIā€™s legal landscape. Here are the most important legal issues of the year 2023:

European AI Act

TheĀ European Union introduced the AI Act, the worldā€™s first comprehensive law, to regulate the use of AI. The act classifies AI systems based on the risk they pose and sets forth regulations accordingly. Although the AI Act has been provisionally agreed upon, its implementation faces delays, and the enforcement wonā€™t commence until 2025.

U.S. Copyright Office Stance on Registration of AI-Generated Content

The U.S. Copyright Office took a decisive stance,Ā denying copyrightĀ registration for images created by the AI algorithm Midjourney. The rejection set a precedent, asserting that AI artworks solely created by AI, without human involvement, are ineligible for copyright protection. In the same vein, theĀ U.S. Copyright Office issued guidanceĀ on AI-assisted works, clarifying that works created by humans using AI tools may be eligible for copyright protection. The guidance confirmed that works created by humans using AI tools should be evaluated based on whether the human role in the creation of those works was determinative.

ā€œCurrently, the existing legal system is not prepared to acknowledge copyright for works created with AI, given that AI learns from existing data, the rights to which belong to other people, challenging the attribution of ownership. The practice for addressing this issue is expected to develop next year, facilitated by public participation through state-conducted surveys. Resolving this matter independently is now difficult without broader public engagement.ā€

Daria Kuznetsova, Corporate Lawyer of Everypixel

McKinseyĀ also released a comprehensive graph capturing the most important AI governance-related policy and regulatory efforts in 2023. The visual representation highlights the significant contributions of 2023 in shaping the legal landscape of AI.

2023: The Year of AI

Source:Ā McKinsey

Debatesā€‹

The year 2023 was abuzz with intriguing debates and discussions, grappling with uncertainties and the evolving norms of the AI landscape. As the industry shapes its course, these debates become inevitable, promising more thought-provoking dialogues and challenges on the horizon. Here are some of the most noteworthy debates that defined the year:

Corporate Restrictions on ChatGPT

Major financial institutions, including JP Morgan, Citigroup, Bank of America, Deutsche Bank, Goldman Sachs, and Wells Fargo & Co,Ā have restricted ChatGPT usageĀ due to security and privacy concerns. This reflected a broader trend where companies were issuing warnings to employees about the legal considerations associated with AI applications in corporate environments.

OpenAIā€™s Use of Low-Paid Workers

Timeā€™s investigation exposed OpenAIā€™s collaboration with Sama,Ā employing low-paid workers in KenyaĀ to sift through sensitive content for ChatGPT. The revelation raised ethical questions about the treatment of workers and the impact of content moderation on mental well-being.

Leadership Transition at OpenAI

Sam Altmanā€™s departureĀ and quick return made headlines last month. A leadership transition unfolded at OpenAI as Sam Altman stepped down amid communication inconsistencies with the board. Interim CEO Mira Murati, along with a majority of staff, advocated for Altmanā€™s return. This unprecedented situation attracted widespread attention, leaving questions about the true reasons behind the transition and future implications.

Adobe and Figma

Adobeā€™s $20 billion acquisition plan for FigmaĀ encountered regulatory hurdles, prompting investigations by the European Commission and the UK Competition and Markets Authority over potential antitrust issues. The proposed dealā€™s impact also extended beyond design considerations, as Adobeā€™s dominance in customer data platforms raised concerns among Chief Information Officers (CIOs) about its potential influence on cloud software spending. However,Ā Adobe abandoned the dealĀ due to challenges in securing antitrust approvals in Europe and the UK, resulting in a termination fee of $1 billion to Figma.

Photographer Hacked the World Photography Awards

Photographer Boris EldagsenĀ disrupted the Sony World Photography AwardsĀ by submitting AI-generated artwork. Eldagsenā€™s refusal to accept the prize sparked a debate on the place of AI-generated images in traditional photography competitions, challenging perceptions of authenticity and creativity.