이 가이드에서는 Gemini 사고 모델의 몇 가지 사용 사례를 간략히 설명합니다. 이러한 모델을 사용하면 개발자가 복잡한 문제를 더 작고 관리하기 쉬운 단계로 분해하여 해결할 수 있습니다. 이 단계별 접근 방식은 어려운 작업의 성능을 향상시킬 뿐만 아니라 개발자에게 Gemini의 추론 프로세스에 관한 유용한 정보를 제공하며, 이는 Google AI 스튜디오 UI에 표시됩니다. 현재 사고 기능이 있는 모델에는 Gemini 2.0 Flash Thinking 및 실험용 Gemini 2.5 Pro Experimental with thinking이 있습니다.
프롬프트 기법
효과적인 프롬프트는 Gemini 사고 모델의 이점을 극대화하는 데 중요합니다. 다음과 같은 기법을 고려해 보세요.
시작하기
원하는 결과를 설명하는 일반적인 안내로 시작합니다. 간단한 프롬프트로 시작하여 문제를 해결하는 방법에 관한 모델의 초기 생각을 관찰합니다.
Describe the most unusual or striking architectural feature you've ever seen in a building or structure. <...> The most unusual architectural feature that has truly stayed with me is the "floating roof" of the Heydar Aliyev Center in Baku, Azerbaijan, designed by Zaha Hadid Architects. Visually, it's breathtaking. Imagine a vast, undulating landscape of pure white, seemingly draped over the building. It's not just a roof in the traditional sense; it's an extension of the ground, flowing seamlessly upwards and over the building's various spaces before curving back down to meet the earth again. There are no sharp angles, no rigid lines. Instead, everything is fluid, organic, and continuous. It resembles a giant, pristine white wave frozen in motion, or perhaps a sand dune sculpted by an unseen wind. <...> |
단계별 안내 제공
첫 번째 시도에서 원하는 결과를 얻지 못한 경우 더 구체적인 단계별 안내를 제공하여 Gemini가 문제 해결 과정을 통해 더 집중적인 접근을 할 수 있도록 프롬프트를 수정하세요. 이렇게 하면 명시적인 제안과 안내를 통해 모델의 태스크에 대한 초기 이해를 유도할 수 있습니다.
Now imagine that architectural feature is not just for looks, but has a specific function or purpose related to the building's environment or the people who use it. Explain what that function could be and how it enhances the building's overall design and usability. <...> Let's imagine that the "floating roof" of the Heydar Aliyev Center is not purely aesthetic but serves a crucial environmental and user-centric function: it's designed as a sophisticated climate moderator and natural ventilation system for the Baku environment. Here's how this function would work and enhance the building: Function: Passive Climate Control and Natural Ventilation System Shading and Solar Heat Mitigation: The undulating curves of the white GFRC roof are strategically shaped and oriented to provide optimal shading throughout the day, year-round. Baku experiences intense sun, especially during the summer. The overhanging curves, acting like giant brise-soleils, would block direct sunlight from hitting the building's glass facade, significantly reducing solar heat gain. The varying heights and depths of the curves would create dynamic shade patterns, ensuring that different parts of the building are shaded at different times of the day, optimizing comfort and reducing the need for excessive air conditioning. The white color of the GFRC further enhances this by reflecting a large portion of the solar radiation. <...> |
사고를 통한 멀티샷 프롬프트
사고와 함께 멀티샷 프롬프트를 사용하면 Gemini의 이해를 더욱 향상시키고 응답의 정확성을 개선할 수 있습니다. 프롬프트에서 원하는 동작과 출력 형식을 보여주는 입력-출력 쌍의 몇 가지 예시를 제공합니다.
Example 1: User: What is the tallest mountain in the world? Assistant: Mount Everest Example 2: User: What is the largest ocean? Assistant: Pacific Ocean User: What is the longest river in the world? Assistant: Amazon River |
출력 및 동작 정의
사용자가 모델과 직접 상호작용하는 애플리케이션을 빌드할 때는 Gemini의 출력 및 응답이 어떻게 들리고 형식이 지정되는지에 관한 안내를 제공하는 것이 좋습니다.
시스템 안내
시스템 안내는 모델에서 프롬프트를 처리하기 전에 처리하는 안내 모음입니다. 모델에 프롬프트가 표시될 때마다 호출되며 모델의 동작 및 응답 방식을 지정합니다. 예를 들어 역할 또는 캐릭터, 문맥 정보, 형식 지정 안내 등을 포함할 수 있습니다.
System Instruction: You are a helpful and concise coding tutor. Always provide code examples and explain them briefly. User Prompt: Explain what is a closure in Javascript and give an example. function outerFunction(outerVar) { function innerFunction(innerVar) { console.log(outerVar); // Accessing outerVar from outerFunction's scope console.log(innerVar); } return innerFunction; } const closureExample = outerFunction('I am from outer'); closureExample('I am from inner'); Explanation: A closure is created when a function (innerFunction in this example) is defined inside another function (outerFunction) and the inner function remembers and has access to the variables from the outer function's scope, even after the outer function has finished executing. In the example: outerFunction is called with 'I am from outer' and returns innerFunction. closureExample now holds a reference to innerFunction. When closureExample('I am from inner') is called, innerFunction executes. Even though outerFunction has already finished, innerFunction can still access and use outerVar (which is 'I am from outer') because of the closure. In essence, the innerFunction "closes over" the variables of its outer function's scope, creating a closure. |
확인 및 반성
시스템 안내 또는 프롬프트에 직접 특정 인증 단계를 포함합니다. 예를 들어 Gemini에게 출처를 확인하고, 추론 과정을 검토하고, 잠재적인 오류를 파악하고, 최종 답변을 확인해 달라고 요청할 수 있습니다.
Verify the following statement and provide your sources: The capital of Brazil is Rio de Janeiro. The statement "The capital of Brazil is Rio de Janeiro" is incorrect. The current capital of Brazil is Brasília. Rio de Janeiro was the capital of Brazil for a significant period, from 1763 to 1960. However, the capital was moved to Brasília in 1960 as part of a plan to develop the country's interior and create a more centrally located capital. Verification and Sources: You can easily verify this information from numerous reliable sources: Encyclopedias: Britannica: Searching for "Brazil" on Britannica will clearly state Brasília as the capital. https://www.britannica.com/place/Brazil (Look for the "Capital" information in the introductory summary or "Geography" section) Wikipedia: The Wikipedia article for "Brazil" also states Brasília as the capital. https://en.wikipedia.org/wiki/Brazil (Look in the infobox or introductory section) Government and Official Sources: CIA World Factbook: This is a highly reliable source for country information. Searching for "Brazil" will list Brasília as the capital. https://www.cia.gov/the-world-factbook/countries/brazil/ (Look under "Government" > "Capital") Official Website of the Brazilian Government (if accessible and in English): While navigating official government websites can sometimes be complex, they would definitively list Brasília as the capital. Geographic Resources: National Geographic: Their website and publications will confirm Brasília as the capital of Brazil. Atlases and Geography Textbooks: Any modern atlas or geography textbook will list Brasília as the capital. Why the confusion? The confusion might arise because: Rio de Janeiro is a very famous Brazilian city: It's a major tourist destination, known for its beaches, Carnival, and iconic landmarks like Christ the Redeemer. Its past status as capital adds to its historical significance and public recognition. Historical context: Rio de Janeiro was the capital for a long time. People who learned about Brazil in the past might remember it as the capital. In conclusion, the statement is false. The capital of Brazil is Brasília, not Rio de Janeiro. Rio de Janeiro was the former capital but is no longer the current capital. You can confidently use the sources listed above to verify this information. |
생각을 통한 도구 활용
Gemini Thinking 모델은 도구를 활용하여 텍스트 생성 외의 작업을 실행할 수 있습니다. 이를 통해 외부 시스템과 상호작용하거나, 코드를 실행하거나, 실시간 정보에 액세스하여 결과를 추론 및 최종 응답에 통합할 수 있습니다. API 구성을 통해 도구를 사용 설정할 수 있습니다.
검색 도구
검색 도구를 사용하면 모델이 외부 검색엔진을 쿼리하여 최신 정보 또는 학습 데이터 외의 정보를 찾을 수 있습니다. 최근 사건이나 매우 구체적인 주제에 관한 질문에 유용합니다.
What were the major scientific breakthroughs announced last week? Based on recent search results, here are some highlights from the past week in science: * **[Specific Breakthrough 1]:** [Brief description based on search results, e.g., A new study published in Nature details advancements in battery technology...] * **[Specific Breakthrough 2]:** [Brief description based on search results, e.g., Researchers at MIT announced progress in understanding dark matter...] * **[Specific Breakthrough 3]:** [Brief description based on search results, e.g., The James Webb Space Telescope captured new images revealing...] (Note: The model would use the search tool to find relevant news and articles from the specified timeframe and synthesize the information. Grounding metadata citing the sources would also typically be available via the API.) |
코드 실행 도구
코드 실행 도구를 사용하면 모델이 코드를 작성하고 실행할 수 있습니다 (예: Python)을 사용하여 계산을 실행하거나, 데이터를 조작하거나, 알고리즘으로 가장 잘 처리되는 문제를 해결합니다. 모델은 코드의 출력을 수신하고 응답에 이를 사용할 수 있습니다.
What is the sum of the first 50 prime numbers? Generate and run Python code for the calculation, and make sure you get all 50. Provide the final sum clearly. Okay, I can calculate that. I will generate Python code to find the first 50 prime numbers and sum them up. ```python def is_prime(num): if num < 2: return False for i in range(2, int(num**0.5) + 1): if num % i == 0: return False return True primes = [] num = 2 while len(primes) < 50: if is_prime(num): primes.append(num) num += 1 total_sum = sum(primes) print(total_sum) ``` Running the code... Code Execution Result: Outcome: OK Output: 5117 Use code with caution. The sum of the first 50 prime numbers is 5117. |
구조화된 출력
JSON과 같은 특정 구조에 따라 출력 형식을 지정하도록 모델에 지시할 수 있습니다. 이는 특히 모델의 출력을 애플리케이션에 통합하는 데 유용합니다. 프롬프트에서 원하는 형식과 스키마를 직접 지정하거나 `response_mime_type` 과 같은 API 매개변수를 사용하여 더 강력하게 제어할 수 있습니다(가능한 경우).
Provide a list of 3 famous physicists and their key contributions in JSON format. Use this JSON schema: Physicist = {'name': str, 'contribution': str, 'era': str} Return: list[Physicist] ```json [ { "name": "Albert Einstein", "contribution": "Theory of Relativity (Special and General)", "era": "Early 20th Century" }, { "name": "Isaac Newton", "contribution": "Laws of Motion and Universal Gravitation", "era": "Late 17th / Early 18th Century" }, { "name": "Marie Curie", "contribution": "Pioneering research on radioactivity", "era": "Late 19th / Early 20th Century" } ] ``` |
디버깅 및 스티어링
Google AI Studio는 개발자가 UI에서 모델의 생각에 액세스할 수 있도록 지원합니다. 이는 개발자가 디버깅할 때 프롬프트를 검토하는 데 유용합니다.
사고 과정 분석
Gemini의 사고 과정을 주의 깊게 검토하여 잘못된 부분을 파악합니다. 이 분석은 프롬프트를 미세 조정하고 애플리케이션에서 일관되게 예상되는 응답을 얻는 데 유용한 단서를 제공할 수 있습니다.
예시 시나리오: Gemini의 사고 과정을 통해 복잡한 안내의 한 단계를 잘못 이해한 것으로 나타나면 프롬프트를 수정하여 해당 단계를 명확히 할 수 있습니다.
Step 1: Summarize the following article about quantum physics. Step 2: Translate the summary into Spanish. Step 3: Extract the key entities from the Spanish summary. Article: ... [Article Text] ... The user wants me to process an article about Google's new quantum chip called Willow in three steps: Step 1: Summarize the article. Step 2: Translate the summary into Spanish. Step 3: Extract the key entities from the Spanish summary. Let's break down each step. Step 1: Summarize the article. I need to read the article and identify the main points. The article is about Google's new quantum chip, Willow, and its achievements. The key achievements are: Exponential error reduction as qubits scale up. Outperforming supercomputers in a benchmark computation (RCS). Willow is a step towards commercially relevant quantum computing. It's the first system below threshold for quantum error correction. It achieved a benchmark computation in under 5 minutes that would take a supercomputer 10 septillion years. It was fabricated in a new facility. It has state-of-the-art performance and high quality qubits. Google is looking towards real-world applications with Willow. Quantum computing will benefit AI. I will synthesize these points into a concise summary. Step 2: Translate the summary into Spanish. Once I have the summary in English, I will translate it into Spanish. I need to ensure the translation is accurate and natural-sounding in Spanish. Step 3: Extract the key entities from the Spanish summary. After translating the summary, I will identify and extract the key entities. Key entities are typically nouns or noun phrases that represent important objects, concepts, or actors. In this context, they will likely include: Names of chips (Willow, Sycamore) Companies/Organizations (Google Quantum AI, Google Research) Technologies (quantum chip, quantum computer, qubits, quantum error correction, superconducting quantum system, classical computers, supercomputers, AI) Benchmarks (random circuit sampling, RCS) People (Julian Kelly, Peter Shor, David Deutsch, Sergio Boixo, Hartmut Neven, John Preskill) Concepts (exponential error reduction, below threshold, beyond breakeven, state-of-the-art performance, commercially relevant applications, quantum mechanics, multiverse, T1 times, system benchmarks, algorithmic benchmarks) I will need to carefully review the Spanish summary to identify these entities and list them out. |
Provide Corrective Feedback
If you observe errors or inconsistencies in Gemini's reasoning, provide corrective feedback in your prompt and/or System Instructions to anticipate where the model's response may deviate from your desired outcome.
You are an expert in botany. When asked to identify a plant, focus on leaf shape, flower structure, and stem characteristics. Identify this plant based on these characteristics: ... [Plant Characteristics Description] ... Previous incorrect response focused on habitat only. Please disregard habitat and focus on morphology. Previous incorrect response focused on habitat only. Please disregard habitat and focus on morphology. Tabebuia vellosoi |
긴 형식의 결과물과 긴 형식의 사고를 최대한 활용하기
사고는 Gemini의 자세한 추론을 보여주는 토큰 출력의 일부로 '생각'을 생성합니다. 이러한 출력을 최대한 활용하려면 다음 단계를 따르세요.
추론 검토
Gemini의 추론 프로세스 (Google AI Studio UI에서 확인 가능), 작업을 분류한 방법, 출력에 대해 선택한 사항에 주의를 기울이세요. 이를 통해 어떤 근거로 해당 결과가 도출되었는지 파악할 수 있습니다.
추론에 관한 안내 제공
원하는 출력이 특히 긴 경우 프롬프트에 더 많은 안내를 제공하여 프롬프트에 표시된 작업을 완료하는 데 필요한 생각의 양을 제한하는 것이 좋습니다. 이렇게 하면 응답에 더 많은 토큰 출력을 예약할 수 있습니다.
예: 자세한 보고서의 경우 프롬프트에서 섹션과 하위 섹션을 지정하여 Gemini의 사고를 유도하고 결과물을 효율적으로 구성할 수 있습니다.
Generate a detailed report on the impact of climate change on coastal ecosystems. Report Structure: Introduction Rising Sea Levels 2.1. Impact on Mangroves 2.2. Impact on Coral Reefs Ocean Acidification 3.1. Impact on Shellfish 3.2. Impact on Plankton Extreme Weather Events 4.1. Increased Storm Intensity 4.2. Coastal Erosion Conclusion Provide detailed analysis within each section. <...> Climate Change Impact on Coastal Ecosystems: A Detailed Report 1. Introduction Coastal ecosystems, the dynamic interface between land and sea, are among the most productive and biodiverse environments on Earth. They provide crucial ecosystem services, including coastal protection, fisheries, carbon sequestration, and support for a vast array of species. However, these vital ecosystems are increasingly threatened by the multifaceted impacts of climate change. Driven by anthropogenic greenhouse gas emissions, climate change is manifesting in several interconnected ways that are profoundly altering the physical and chemical conditions of coastal environments. This report will delve into the key impacts of climate change on coastal ecosystems, focusing on rising sea levels, ocean acidification, and extreme weather events, and their specific consequences for critical habitats like mangroves and coral reefs, and key organisms like shellfish and plankton. Understanding these impacts is crucial for developing effective conservation and management strategies to mitigate the damage and enhance the resilience of these invaluable ecosystems. 2. Rising Sea Levels Rising sea levels are a direct and significant consequence of global warming, driven by thermal expansion of seawater and the melting of land ice (glaciers and ice sheets). This phenomenon poses a direct threat to low-lying coastal ecosystems by inundating habitats, altering salinity regimes, and increasing erosion rates. The rate of sea level rise is accelerating, and projections for the 21st century indicate a continued upward trend, with potentially devastating consequences for coastal environments. 2.1. Impact on Mangroves Mangrove forests are unique coastal ecosystems found in tropical and subtropical intertidal zones. They provide numerous benefits, including coastal protection against storms, nursery grounds for fish and invertebrates, and significant carbon sequestration... <...> |
기술 세부정보
- 사용 가능한 모델: Gemini 2.0 Flash Thinking, Gemini 2.5 Pro Experimental with thinking
- 멀티모달 입력: 텍스트, 오디오, 이미지, PDF (특정 모델 버전에 따라 지원 여부가 다를 수 있음)
- 출력: 텍스트 전용 (최종 응답)
- 토큰 입력: 긴 컨텍스트 (최대 100만 토큰까지 가능, 특정 모델 문서 참고)
- 토큰 출력: 최대 64,000개 토큰 (특정 모델 문서 참고)
참고: 총 토큰 출력에는 모델의 '생각'과 응답이 모두 포함됩니다. 요청의 복잡도와 도구 사용에 따라 최종 응답 출력의 최대 길이가 다를 수 있습니다.
이러한 기법을 구현하고 도구를 활용하면 Gemini 사고 모델을 통해 다양한 복잡한 작업을 처리하고 결과를 개선할 수 있습니다.
다음 단계
- 이제 프롬프트 설계의 개념을 이해했으므로 Google AI 스튜디오를 사용하여 직접 프롬프트를 작성해 보세요.
- 프롬프트 설계에 대한 자세한 내용은 프롬프트 전략 주제를 참고하세요.
- 멀티모달 프롬프트에 대한 자세한 내용은 미디어 파일로 프롬프트를 참고하세요.