Instrucciones para modelos de pensamiento de Gemini

En esta guía, se describen algunas prácticas de uso para los modelos de pensamiento de Gemini. Estos modelos permiten a los desarrolladores abordar problemas complejos dividiéndolos en pasos más pequeños y fáciles de administrar. Este enfoque paso a paso no solo mejora el rendimiento en tareas desafiantes, sino que también proporciona a los desarrolladores estadísticas valiosas sobre el proceso de razonamiento de Gemini, que se puede ver en la IU de Google AI Studio. Actualmente, los modelos con capacidades de pensamiento incluyen Gemini 2.0 Flash Thinking y el experimental Gemini 2.5 Pro Experimental con pensamiento.

Técnicas de instrucción

Las instrucciones eficaces son fundamentales para maximizar los beneficios de los modelos de pensamiento de Gemini. Estas son algunas técnicas que debes tener en cuenta:

Cómo comenzar

Comienza con instrucciones generales que describan el resultado deseado. Comienza con una instrucción simple y observa las ideas iniciales del modelo sobre cómo abordar el desafío.

Describe the most unusual or striking architectural feature you've ever seen in a building or structure.
  
<...>
The most unusual architectural feature that has truly stayed with me is the "floating roof" of the
  Heydar Aliyev Center in Baku, Azerbaijan, designed by Zaha Hadid Architects.

Visually, it's breathtaking. Imagine a vast, undulating landscape of pure white, seemingly draped
over the building. It's not just a roof in the traditional sense; it's an extension of the ground,
flowing seamlessly upwards and over the building's various spaces before curving back down to meet
the earth again. There are no sharp angles, no rigid lines. Instead, everything is fluid, organic,
and continuous. It resembles a giant, pristine white wave frozen in motion, or perhaps a sand dune
sculpted by an unseen wind.
<...>
  

Proporciona instrucciones paso a paso

Si el intento inicial no produce el resultado deseado, define mejor la instrucción proporcionando instrucciones paso a paso más explícitas para guiar a Gemini a través del proceso de resolución de problemas y lograr un enfoque más enfocado. Esto te permite guiar la comprensión inicial del modelo de la tarea con sugerencias y orientación explícitas.

Now imagine that architectural feature is not just for looks, but has a specific function or
purpose related to the building's environment or the people who use it. Explain what that
function could be and how it enhances the building's overall design and usability.
  
<...>
Let's imagine that the "floating roof" of the Heydar Aliyev Center is not purely aesthetic but
serves a crucial environmental and user-centric function: it's designed as a sophisticated climate
moderator and natural ventilation system for the Baku environment.

Here's how this function would work and enhance the building:

Function: Passive Climate Control and Natural Ventilation System

Shading and Solar Heat Mitigation: The undulating curves of the white GFRC roof are strategically
shaped and oriented to provide optimal shading throughout the day, year-round. Baku experiences
intense sun, especially during the summer. The overhanging curves, acting like giant brise-soleils,
would block direct sunlight from hitting the building's glass facade, significantly reducing solar
heat gain. The varying heights and depths of the curves would create dynamic shade patterns, ensuring
that different parts of the building are shaded at different times of the day, optimizing comfort
and reducing the need for excessive air conditioning. The white color of the GFRC further enhances
this by reflecting a large portion of the solar radiation.
<...>
  

Instrucción de varias tomas con pensamiento

Las instrucciones de varias fotos en conjunto con el pensamiento pueden mejorar aún más la comprensión de Gemini y la precisión de sus respuestas. Proporciona varios ejemplos de pares de entrada-salida que demuestren el comportamiento y el formato de salida deseados en tu instrucción.

Example 1:
User: What is the tallest mountain in the world?
Assistant: Mount Everest

Example 2:
User: What is the largest ocean?
Assistant: Pacific Ocean

User: What is the longest river in the world?
Assistant:
  
Amazon River
  

Cómo definir el comportamiento y la salida

Cuando crees una aplicación en la que los usuarios interactuarán directamente con el modelo, te recomendamos que brindes orientación sobre cómo se verán y tendrán formato las respuestas y el resultado de Gemini.

Instrucciones del sistema

Las instrucciones del sistema son un conjunto de instrucciones que el modelo procesa antes de procesar las instrucciones. Se recuerdan cada vez que se le solicita al modelo y le indican cómo quieres que se comporte y responda. Por ejemplo, puedes incluir elementos como el rol o el arquetipo, la información contextual y las instrucciones de formato.

System Instruction: You are a helpful and concise coding tutor. Always provide code examples and explain them briefly.

User Prompt: Explain what is a closure in Javascript and give an example.
  
    function outerFunction(outerVar) {
      function innerFunction(innerVar) {
        console.log(outerVar); // Accessing outerVar from outerFunction's scope
        console.log(innerVar);
      }
      return innerFunction;
    }

    const closureExample = outerFunction('I am from outer');
    closureExample('I am from inner');

Explanation:

A closure is created when a function (innerFunction in this example) is defined inside another function
(outerFunction) and the inner function remembers and has access to the variables from the outer
function's scope, even after the outer function has finished executing.

In the example:

outerFunction is called with 'I am from outer' and returns innerFunction.

closureExample now holds a reference to innerFunction.

When closureExample('I am from inner') is called, innerFunction executes.

Even though outerFunction has already finished, innerFunction can still access and use outerVar
(which is 'I am from outer') because of the closure.

In essence, the innerFunction "closes over" the variables of its outer function's scope, creating
a closure.
  

Verificación y reflexión

Incluye pasos de verificación específicos en las instrucciones del sistema o en tus instrucciones directamente. Por ejemplo, pídele a Gemini que verifique sus fuentes, revise su razonamiento, identifique posibles errores y verifique su respuesta final.

    Verify the following statement and provide your sources: The capital of Brazil is Rio de Janeiro.
  
The statement "The capital of Brazil is Rio de Janeiro" is incorrect.

The current capital of Brazil is Brasília.

Rio de Janeiro was the capital of Brazil for a significant period, from 1763 to 1960. However, the
capital was moved to Brasília in 1960 as part of a plan to develop the country's interior and create
a more centrally located capital.

Verification and Sources:

You can easily verify this information from numerous reliable sources:

Encyclopedias:

Britannica: Searching for "Brazil" on Britannica will clearly state Brasília as the capital.

https://www.britannica.com/place/Brazil (Look for the "Capital" information in the introductory summary or "Geography" section)

Wikipedia: The Wikipedia article for "Brazil" also states Brasília as the capital.

https://en.wikipedia.org/wiki/Brazil (Look in the infobox or introductory section)

Government and Official Sources:

CIA World Factbook: This is a highly reliable source for country information. Searching for "Brazil"
will list Brasília as the capital.

https://www.cia.gov/the-world-factbook/countries/brazil/ (Look under "Government" > "Capital")

Official Website of the Brazilian Government (if accessible and in English): While navigating official
government websites can sometimes be complex, they would definitively list Brasília as the capital.

Geographic Resources:

National Geographic: Their website and publications will confirm Brasília as the capital of Brazil.

Atlases and Geography Textbooks: Any modern atlas or geography textbook will list Brasília as the capital.

Why the confusion?

The confusion might arise because:

Rio de Janeiro is a very famous Brazilian city: It's a major tourist destination, known for its
beaches, Carnival, and iconic landmarks like Christ the Redeemer. Its past status as capital adds
to its historical significance and public recognition.

Historical context: Rio de Janeiro was the capital for a long time. People who learned about Brazil
in the past might remember it as the capital.

In conclusion, the statement is false. The capital of Brazil is Brasília, not Rio de Janeiro. Rio
de Janeiro was the former capital but is no longer the current capital.

You can confidently use the sources listed above to verify this information.
  

Aprovecha las herramientas con el pensamiento

Los modelos de Gemini Thinking pueden aprovechar herramientas para realizar acciones más allá de la generación de texto. Esto les permite interactuar con sistemas externos, ejecutar código o acceder a información en tiempo real, e incorporar los resultados en su razonamiento y respuesta final. Puedes habilitar herramientas a través de la configuración de la API.

Herramienta de búsqueda

La herramienta de búsqueda permite que el modelo consulte motores de búsqueda externos para encontrar información actualizada o información más allá de sus datos de entrenamiento. Esto es útil para preguntas sobre eventos recientes o temas muy específicos.

What were the major scientific breakthroughs announced last week?
  
Based on recent search results, here are some highlights from the past week in science:

*   **[Specific Breakthrough 1]:** [Brief description based on search results, e.g., A new study published in Nature details advancements in battery technology...]
*   **[Specific Breakthrough 2]:** [Brief description based on search results, e.g., Researchers at MIT announced progress in understanding dark matter...]
*   **[Specific Breakthrough 3]:** [Brief description based on search results, e.g., The James Webb Space Telescope captured new images revealing...]

(Note: The model would use the search tool to find relevant news and articles from the specified timeframe and synthesize the information. Grounding metadata citing the sources would also typically be available via the API.)
  

Herramienta de ejecución de código

La herramienta de ejecución de código permite que el modelo escriba y ejecute código (p.ej., Python) para realizar cálculos, manipular datos o resolver problemas que se manejan mejor de forma algorítmica. El modelo recibe el resultado del código y puede usarlo en su respuesta.

What is the sum of the first 50 prime numbers?
Generate and run Python code for the calculation, and make sure you get all 50.
Provide the final sum clearly.
  
Okay, I can calculate that. I will generate Python code to find the first 50 prime numbers and sum them up.

```python
def is_prime(num):
    if num < 2:
        return False
    for i in range(2, int(num**0.5) + 1):
        if num % i == 0:
            return False
    return True

primes = []
num = 2
while len(primes) < 50:
    if is_prime(num):
        primes.append(num)
    num += 1

total_sum = sum(primes)
print(total_sum)
```

Running the code...

Code Execution Result:
Outcome: OK
Output:

5117
Use code with caution.
The sum of the first 50 prime numbers is 5117.

Resultados estructurados

Puedes indicarle al modelo que les otorgue un formato a sus resultados según una estructura específica, como JSON. Esto es especialmente útil para integrar el resultado del modelo en las aplicaciones. Puedes especificar el formato y el esquema deseados directamente en la instrucción o usar parámetros de API, como "response_mime_type", para obtener un control más sólido cuando esté disponible.

Provide a list of 3 famous physicists and their key contributions
in JSON format.
Use this JSON schema:

Physicist = {'name': str, 'contribution': str, 'era': str}
Return: list[Physicist]
```json
[
{
"name": "Albert Einstein",
"contribution": "Theory of Relativity (Special and General)",
"era": "Early 20th Century"
},
{
"name": "Isaac Newton",
"contribution": "Laws of Motion and Universal Gravitation",
"era": "Late 17th / Early 18th Century"
},
{
"name": "Marie Curie",
"contribution": "Pioneering research on radioactivity",
"era": "Late 19th / Early 20th Century"
}
]
```

Depuración y dirección

Google AI Studio proporciona a los desarrolladores acceso a los pensamientos del modelo en la IU. Esto es útil para que un desarrollador revise tu instrucción cuando realice la depuración:

Analiza el proceso de pensamiento

Revisa cuidadosamente el proceso de pensamiento de Gemini para identificar las áreas en las que podría haberse desviado. Este análisis puede proporcionar pistas valiosas para definir mejor tus instrucciones y obtener la respuesta esperada de forma coherente en tu aplicación.

Situación de ejemplo: Si el proceso de pensamiento de Gemini muestra que no entendió un paso en una instrucción compleja, puedes definir mejor la instrucción para aclarar ese paso específico.

Step 1: Summarize the following article about quantum physics.
Step 2: Translate the summary into Spanish.
Step 3: Extract the key entities from the Spanish summary.

Article: ... [Article Text] ...
The user wants me to process an article about Google's new quantum chip called Willow in three steps:
Step 1: Summarize the article.
Step 2: Translate the summary into Spanish.
Step 3: Extract the key entities from the Spanish summary.

Let's break down each step.

Step 1: Summarize the article.
I need to read the article and identify the main points. The article is about Google's new quantum
chip, Willow, and its achievements. The key achievements are:

Exponential error reduction as qubits scale up.

Outperforming supercomputers in a benchmark computation (RCS).

Willow is a step towards commercially relevant quantum computing.

It's the first system below threshold for quantum error correction.

It achieved a benchmark computation in under 5 minutes that would take a supercomputer 10 septillion years.

It was fabricated in a new facility.

It has state-of-the-art performance and high quality qubits.

Google is looking towards real-world applications with Willow.

Quantum computing will benefit AI.

I will synthesize these points into a concise summary.

Step 2: Translate the summary into Spanish.
Once I have the summary in English, I will translate it into Spanish. I need to ensure the translation
is accurate and natural-sounding in Spanish.

Step 3: Extract the key entities from the Spanish summary.
After translating the summary, I will identify and extract the key entities. Key entities are typically
nouns or noun phrases that represent important objects, concepts, or actors. In this context, they
will likely include:

Names of chips (Willow, Sycamore)

Companies/Organizations (Google Quantum AI, Google Research)

Technologies (quantum chip, quantum computer, qubits, quantum error correction, superconducting
quantum system, classical computers, supercomputers, AI)

Benchmarks (random circuit sampling, RCS)

People (Julian Kelly, Peter Shor, David Deutsch, Sergio Boixo, Hartmut Neven, John Preskill)

Concepts (exponential error reduction, below threshold, beyond breakeven, state-of-the-art performance,
commercially relevant applications, quantum mechanics, multiverse, T1 times, system benchmarks,
algorithmic benchmarks)

I will need to carefully review the Spanish summary to identify these entities and list them out.

Provide Corrective Feedback

If you observe errors or inconsistencies in Gemini's reasoning, provide corrective feedback in your prompt and/or System Instructions to anticipate where the model's response may deviate from your desired outcome.

You are an expert in botany. When asked to identify a plant, focus on leaf shape, flower structure, and stem characteristics.

Identify this plant based on these characteristics: ... [Plant Characteristics Description] ...

Previous incorrect response focused on habitat only. Please disregard habitat and focus on morphology.
Previous incorrect response focused on habitat only. Please disregard habitat and focus on morphology.
Tabebuia vellosoi

Aprovecha al máximo los resultados largos y el pensamiento de formato largo

El pensamiento genera “pensamientos” como parte del resultado de token que muestra el razonamiento detallado de Gemini. Para aprovechar al máximo estos resultados, haz lo siguiente:

Revisa el razonamiento

Presta especial atención al proceso de razonamiento de Gemini (disponible en la IU de Google AI Studio), cómo desglosó la tarea y las opciones que tomó para tu resultado. Esto puede ayudarte a comprender cómo llegó a su conclusión.

Proporciona orientación en el razonamiento

Si el resultado que deseas es particularmente extenso, te recomendamos que brindes más orientación en la consigna para limitar la cantidad de pensamiento necesario para completar la tarea que se presenta en ella. Esto te permite reservar más de la salida de token para tu respuesta.

Ejemplo: Para obtener un informe detallado, puedes especificar secciones y sub secciones en tu instrucción para guiar el pensamiento de Gemini y estructurar el resultado de manera eficiente.

Generate a detailed report on the impact of climate change on coastal ecosystems.

Report Structure:

Introduction

Rising Sea Levels
2.1. Impact on Mangroves
2.2. Impact on Coral Reefs

Ocean Acidification
3.1. Impact on Shellfish
3.2. Impact on Plankton

Extreme Weather Events
4.1. Increased Storm Intensity
4.2. Coastal Erosion

Conclusion

Provide detailed analysis within each section.
<...>
Climate Change Impact on Coastal Ecosystems: A Detailed Report

1. Introduction

Coastal ecosystems, the dynamic interface between land and sea, are among the most productive and
biodiverse environments on Earth. They provide crucial ecosystem services, including coastal protection,
fisheries, carbon sequestration, and support for a vast array of species. However, these vital
ecosystems are increasingly threatened by the multifaceted impacts of climate change. Driven by
anthropogenic greenhouse gas emissions, climate change is manifesting in several interconnected ways
that are profoundly altering the physical and chemical conditions of coastal environments. This report
will delve into the key impacts of climate change on coastal ecosystems, focusing on rising sea levels,
ocean acidification, and extreme weather events, and their specific consequences for critical habitats
like mangroves and coral reefs, and key organisms like shellfish and plankton. Understanding these
impacts is crucial for developing effective conservation and management strategies to mitigate the
damage and enhance the resilience of these invaluable ecosystems.

2. Rising Sea Levels

Rising sea levels are a direct and significant consequence of global warming, driven by thermal
expansion of seawater and the melting of land ice (glaciers and ice sheets). This phenomenon poses a
direct threat to low-lying coastal ecosystems by inundating habitats, altering salinity regimes, and
increasing erosion rates. The rate of sea level rise is accelerating, and projections for the 21st
century indicate a continued upward trend, with potentially devastating consequences for coastal
environments.

2.1. Impact on Mangroves

Mangrove forests are unique coastal ecosystems found in tropical and subtropical intertidal zones.
They provide numerous benefits, including coastal protection against storms, nursery grounds for
fish and invertebrates, and significant carbon sequestration...
<...>

Detalles técnicos

  • Modelos disponibles: Gemini 2.0 Flash Thinking y Gemini 2.5 Pro Experimental con Thinking
  • Entrada multimodal: Texto, audio, imágenes y archivos PDF (la compatibilidad puede variar según la versión específica del modelo)
  • Resultado: Solo texto (respuesta final)
  • Entrada de tokens: Contexto ampliado, potencialmente hasta 1 millón de tokens (consulta la documentación del modelo específico)
  • Resultado de tokens: Hasta 64,000 tokens (consulta la documentación específica del modelo)

NOTA: El resultado total de tokens incluye los "pensamientos" del modelo y la respuesta. Según la complejidad de tu solicitud y el uso de herramientas, la longitud máxima de la salida de la respuesta final puede variar.

Si implementas estas técnicas y aprovechas las herramientas, los modelos de pensamiento de Gemini pueden ayudarte a abordar una variedad de tareas complejas y pueden mejorar los resultados.

Próximos pasos