gemini_audio

This function sends audio to the Gemini API and returns a text description.

Provides a comprehensive interface for Google Gemini API,
enabling users to access and utilize Gemini Large Language Model (LLM) functionalities directly from R.
This package facilitates seamless integration with Google Gemini, allowing for advanced language processing,
text generation, and other AI-driven capabilities within the R environment.
For more information, please visit <https://ai.google.dev/docs/gemini_api_overview>.

Jinhwan Kim

gemini.R

Interface for 'Google Gemini' API

Maciej Nasinski

gemini_audio function

<dl><dt>audio</dt>
<dd>Path to the audio file (default: uses a sample file). Must be an MP3.</dd>
<dt>prompt</dt>
<dd>A string describing what to do with the audio.</dd>
<dt>model</dt>
<dd>The Gemini model to use ("1.5-flash" or "1.5-pro", "2.0-flash-exp"). Defaults to "1.5-flash".</dd>
<dt>temperature</dt>
<dd>Controls the randomness of the generated text (0-2). Defaults to 0.5.</dd>
<dt>maxOutputTokens</dt>
<dd>The maximum number of tokens in the generated text. Defaults to 1024.</dd></dl>

Arguments

Analyze audio using Gemini — gemini_audio

<dl>

<dt>audio</dt>
<dd>Path to the audio file (default: uses a sample file). Must be an MP3.</dd>


<dt>prompt</dt>
<dd>A string describing what to do with the audio.</dd>


<dt>model</dt>
<dd>The Gemini model to use ("1.5-flash" or "1.5-pro", "2.0-flash-exp"). Defaults to "1.5-flash".</dd>


<dt>temperature</dt>
<dd>Controls the randomness of the generated text (0-2). Defaults to 0.5.</dd>


<dt>maxOutputTokens</dt>
<dd>The maximum number of tokens in the generated text. Defaults to 1024.</dd>

</dl>

gemini_audio: Analyze audio using Gemini

Description

Usage

Value

Arguments

Examples