baoyu-danger-gemini-web

by JimLiu

baoyu-danger-gemini-web is a Gemini Web client wrapper for text, image generation, reference-image vision, and multi-turn chat. It runs from the local skill repo with bun or npx, uses scripts under `scripts/`, and requires a first-use consent check before install and usage.

Stars13.2k

Favorites0

Comments0

AddedApr 5, 2026

CategoryImage Generation

Install Command

npx skills add JimLiu/baoyu-skills --skill baoyu-danger-gemini-web

Curation Score

This skill scores 74/100, which means it is listable and likely useful for agents that need Gemini-backed text or image generation, but directory users should expect some setup and policy-sensitive handling. The repository shows real workflow substance: a substantial SKILL.md, explicit trigger phrases, consent gating for reverse-engineered API use, and a nontrivial TypeScript implementation with a CLI entry point and Gemini web client library.

74/100

Strengths

Good triggerability: the description clearly names use cases like Gemini text generation, image generation, vision/reference-image input, and multi-turn conversations.
Real operational substance: the repo includes a sizable scripts directory with main.ts plus a full gemini-webapi client, types, utilities, and error handling rather than a thin prompt wrapper.
Trust signal for risky workflow: SKILL.md explicitly requires a user consent check before first use for the reverse-engineered API.

Cautions

Adoption friction remains: SKILL.md has runtime resolution guidance but no direct install command, and the workflow likely depends on external environment/browser cookie setup that is only partially visible here.
This is a reverse-engineered 'danger' integration, so reliability and policy fit may be more fragile than official APIs.

Gemini Google Ai Cli TypeScript Node.js Workflow

Overview

Overview of baoyu-danger-gemini-web skill

What baoyu-danger-gemini-web skill does

The baoyu-danger-gemini-web skill is a Gemini Web client wrapper that lets an agent generate text, create images, use reference images, and continue multi-turn chats through a reverse-engineered web flow rather than an official API. Its real value is practical: if you need Gemini-style image generation or vision-capable prompting inside a skill workflow, this gives you a runnable local path instead of hand-writing fragile web requests.

Who should install it

Best fit: users who already work with local AI skills, are comfortable running bun or npx -y bun, and specifically want baoyu-danger-gemini-web for Image Generation or vision input support. It is less suitable for teams that need official vendor support, stable API contracts, or easy cloud deployment. The repo itself requires a consent check before first use, which is a strong signal that this is for informed, hands-on users.

Why people choose it over a plain prompt

A normal prompt cannot give you the transport layer, cookie handling, upload flow, token refresh behavior, or chat/image request structure. baoyu-danger-gemini-web skill packages those details in scripts/main.ts and the scripts/gemini-webapi/ client code. That reduces guesswork when the job is “use Gemini Web as a backend,” not merely “write a prompt that sounds good.”

Key adoption tradeoffs

The main differentiator is capability breadth: text, image generation, reference image input, chat continuity, and gem-related client support in one code path. The main tradeoff is obvious from the name and code: this depends on reverse-engineered Gemini Web behavior, so breakage, auth friction, rate limits, and temporary blocks are realistic risks. If that makes your use case unacceptable, do not install it.

How to Use baoyu-danger-gemini-web skill

Install context and runtime requirements

baoyu-danger-gemini-web install starts with the skill repo plus a JS runtime. The skill frontmatter expects either bun or npx. In practice, read these files first:

SKILL.md
scripts/package.json
scripts/main.ts
scripts/gemini-webapi/client.ts
scripts/gemini-webapi/constants.ts
scripts/gemini-webapi/exceptions.ts

The repo says all executable logic lives under scripts/. If bun is installed, use it; otherwise the skill explicitly supports npx -y bun. Do not skip the consent step described in SKILL.md, because first-use consent is part of the intended workflow.

What input baoyu-danger-gemini-web needs

For strong baoyu-danger-gemini-web usage, provide:

a clear task type: text, image generation, vision with reference image, or multi-turn chat
the actual user goal in one sentence
output constraints such as style, aspect, format, audience, or number of variants
any reference image path or prior conversation context
failure tolerance: whether you want one best try or iterative refinement

A weak request is: “Make an image with Gemini.”
A stronger request is: “Use baoyu-danger-gemini-web to generate 3 product-hero concepts for a matte black coffee grinder on a white seamless background, premium e-commerce style, front 3/4 angle, soft studio shadows, no text overlays.”

Turn a rough goal into a usable invocation

The skill works best when you transform intent into an execution-ready brief. A practical template:

State mode: image, text, or vision.
Define subject and purpose.
Add style and constraints.
Add references or chat history.
Say what success looks like.

Example prompt for baoyu-danger-gemini-web for Image Generation:
“Generate an image. Subject: futuristic hiking backpack for a landing page hero. Style: outdoor editorial photography, sunrise rim light, realistic materials. Constraints: no people, centered composition, 16:9, high detail, brand-safe. Return the best result plus a short revision note if the scene looks too cluttered.”

For reference-image use:
“Analyze the attached product photo and generate a cleaner marketing version that preserves silhouette, zipper placement, and color blocking, but improves lighting and background.”

Suggested workflow and debugging path

Start simple: text-only or one image request, then add references and chat continuity after the transport works. If something fails, inspect the likely layer in this order:

SKILL.md for consent and runtime rules
scripts/main.ts for CLI behavior
scripts/gemini-webapi/utils/load-browser-cookies.ts and cookie-file.ts for auth/cookie handling
get-access-token.ts and rotate-1psidts.ts for token refresh issues
exceptions.ts for likely failure classes such as AuthError, UsageLimitExceeded, TemporarilyBlocked, or ImageGenerationError

This repo is easier to adopt if you treat it like an operational connector, not a generic prompt pack.

baoyu-danger-gemini-web skill FAQ

Is baoyu-danger-gemini-web skill good for beginners?

Only if you are comfortable reading TypeScript and handling local runtime issues. The baoyu-danger-gemini-web guide is not hard conceptually, but it is not beginner-friendly in the “click once and go” sense. You may need to troubleshoot cookies, browser session state, or model behavior.

When is this better than official APIs or ordinary prompting?

Use baoyu-danger-gemini-web skill when you specifically need Gemini Web behaviors, image generation through this path, or a local skill that can combine text, images, and chat in one flow. Choose official APIs when you need supportability, contracts, clearer quotas, or production reliability. Choose ordinary prompting only when you do not need executable integration.

What are the main risks before install?

The biggest risks are auth breakage, service-side changes, usage limits, and temporary IP or account restrictions. The codebase already defines error cases for these, which is useful, but it also confirms they are expected realities rather than edge cases.

When should I avoid baoyu-danger-gemini-web for Image Generation?

Avoid it if your environment cannot use reverse-engineered web access, if your compliance team requires official endpoints only, or if your workflow depends on highly predictable uptime. It is also a poor fit if you only need basic text generation that an official SDK already handles well.

How to Improve baoyu-danger-gemini-web skill

Give richer task briefs, not just longer prompts

The fastest quality gain in baoyu-danger-gemini-web usage comes from structured inputs. For image tasks, specify subject, scene, composition, lighting, realism level, exclusions, and intended use. For vision tasks, name what must stay unchanged versus what can be redesigned. This reduces drift and makes first-pass outputs more usable.

Reduce common failure modes

Typical failures are vague prompts, missing reference constraints, and trying to do too much in one turn. Split work into stages:

first generate or analyze
then refine composition/style
then request final cleanup

If image output is off-brand, add explicit negatives like “no text, no watermark, no extra objects, no exaggerated stylization.” If chat context gets muddy, restate the current objective instead of assuming the session inferred it.

Use repo internals to troubleshoot smarter

To improve baoyu-danger-gemini-web skill results, read code where behavior actually lives. constants.ts reveals endpoints and model definitions. client.ts shows retry, timeout, upload, and parsing logic. exceptions.ts tells you how the tool classifies failure. That is more actionable than repeatedly rerunning the same prompt.

Iterate after the first output

Do not judge the skill on one pass. Ask for targeted revisions:

“keep composition, reduce background clutter”
“preserve product shape, improve material realism”
“same scene, but cleaner color palette and softer shadows”

This works better than “make it better,” because baoyu-danger-gemini-web is strongest when the objective and change request are both explicit.

Ratings & Reviews

No ratings yet

Share your review

0/10000

Latest reviews

Saving...

more skill

openclaw-persona-forge

by affaan-m

openclaw-persona-forge is a workflow-driven skill for building complete OpenClaw persona packages from scratch. It creates identity tension, SOUL.md-style framing, boundary rules, name options, and optional avatar prompt guidance. Best for OpenClaw character design, roleplay agents, and UI Design-adjacent persona work, not for minor edits to an existing persona.

UI Design

Favorites 0GitHub 156.2k

baoyu-imagine

by JimLiu

baoyu-imagine is a multi-provider image generation skill with a typed CLI, mandatory EXTEND.md setup, reference image support, aspect ratio controls, and batch runs across OpenAI, Azure OpenAI, Google, OpenRouter, DashScope, MiniMax, Jimeng, Seedream, and Replicate.

Image Generation

Favorites 0GitHub 13.2k

baoyu-comic

by JimLiu

baoyu-comic is a skill for turning source text into educational or biography-style comics with storyboard planning, character consistency, panel layouts, and staged image generation. It supports installable CLI usage, style and layout options, and partial workflows like --storyboard-only, --prompts-only, and --regenerate for controlled comic production.

Image Generation

Favorites 0GitHub 13.2k

shader-dev

by MiniMax-AI

shader-dev is a practical GLSL shader skill for ShaderToy-style real-time visuals. Use the shader-dev skill to build or debug ray marching, SDF scenes, lighting, particles, fluid motion, post-processing, and shader-dev for UI Design with less guesswork than a generic prompt.

UI Design

Favorites 0GitHub 11.7k

videoagent-video-studio

by pexoai

videoagent-video-studio is a skill for generating short AI videos from text, images, and references. Use it to test text-to-video and image-to-video workflows, compare supported models, and run the hosted proxy or self-hosted setup with Node 18+.

Video Editing

Favorites 0GitHub 456

seo-image-gen

by AgriciDaniel

seo-image-gen is a GitHub skill for turning SEO image requests into production-ready prompts and settings for OG images, social previews, hero banners, product visuals, infographics, and thumbnails. It uses Gemini via nanobanana-mcp and assumes the banana extension is installed for a practical seo-image-gen guide and workflow.

Image Generation

Favorites 0GitHub 0

baoyu-xhs-images

by JimLiu

baoyu-xhs-images turns articles or notes into Xiaohongshu infographic carousels with presets, styles, layouts, and setup guidance. It helps users install the skill, choose inputs, and run `/baoyu-xhs-images` for structured multi-slide social posts.

UI Design

Favorites 0GitHub 13.2k

baoyu-cover-image

by JimLiu

baoyu-cover-image helps agents generate structured article cover-image prompts using type, palette, rendering, text, and mood. It supports 2.35:1, 16:9, and 1:1 formats, includes auto-selection rules and compatibility guidance, and fits repeatable editorial and UI Design cover workflows.

UI Design

Favorites 0GitHub 13.2k

meme-factory

by softaworks

meme-factory helps agents create template-based memes with the free memegen.link API, plus Markdown-friendly text memes. Use it to generate shareable meme URLs, pick fitting templates, format text correctly, and automate output with the included Python helper.

Image Generation

Favorites 0GitHub 1.3k

logo-creator

by ReScienceLab

logo-creator is an AI logo workflow for generating concepts, comparing variations, cropping, removing backgrounds, and exporting SVG assets. It uses style references, example prompts, and helper scripts for logo, icon, favicon, and brand mark creation in ReScienceLab/opc-skills.

Branding

Favorites 0GitHub 0

scientific-schematics

by K-Dense-AI

scientific-schematics turns natural-language prompts into publication-quality scientific diagrams with smart iterative refinement. It uses Nano Banana 2 for generation and Gemini 3.1 Pro Preview for review, regenerating only when output falls below the threshold for your document type. Built for neural network architectures, system diagrams, flowcharts, biological pathways, and other complex scientific visuals.

Image Generation

Favorites 0GitHub 0

banner-creator

by ReScienceLab

banner-creator helps create banners, headers, and hero images with a structured workflow: gather requirements, generate variations, refine with feedback, and crop to platform ratios using the included script.

UI Design

Favorites 0GitHub 0

baoyu-article-illustrator

by JimLiu

baoyu-article-illustrator helps agents turn article drafts into structured illustration prompts, placements, and consistent type-plus-style decisions for explainers, tutorials, diagrams, and multi-image posts.

Image Generation

Favorites 0GitHub 13.2k

nanobanana

by ReScienceLab

nanobanana is a Python CLI skill for Google Gemini 3 Pro Image that supports text-to-image, image editing, aspect ratios, 2K/4K output, and batch generation with simple local scripts.

Image Generation

Favorites 0GitHub 654

sound-fx

by NoizAI

Use the sound-fx skill to turn text prompts into sound effects, foley, ambient beds, creature sounds, and UI noises. It fits sound-fx for Audio Editing, quick prototyping, and downloadable audio assets. Install with NoizAI/skills, then use the script-based workflow with a valid Noiz API key. Not for speech, lyrics, melody, or voice cloning.

Audio Editing

Favorites 0GitHub 498

chat-with-anyone

by NoizAI

chat-with-anyone helps you clone a real person's voice from public audio or design a matching voice from an image, then generate synthetic replies with TTS. It supports practical workflows for roleplay, narration, and voice generation, with guidance on install, source selection, and safe usage.

Voice Generation

Favorites 0GitHub 498