11 Mar 23:27

d9829b5

The New Computer Update Part II Pre-release

Pre-release

The New Computer Update Part II introduces our first local vision model, a suite of native Mac integrations, 5x launch speed, and dozens of other requested features.

Mac Control

The new calendar, contacts, browser, mail, and sms Computer API modules let Open Interpreter control your Mac's native applications.

For example, Open Interpreter can execute the following code to get a contact, then send a text and an email with an attachment.

LLM-first Web Browser

The browser lets Open Interpreter quickly browse the web by querying a web-enabled language model.

Point Model

We have developed a model called point which is capable of locating visual controls precisely. It was designed to run locally on consumer hardware.

We leverage existing open-source models to "semantically search the screen" for text and icons. Language models can then call on this composite model to 'point' at text or icons on the screen:

While this model is capable of understanding simple interfaces with discrete icons, we intend to explore more general solutions in the next few weeks.

And more:

5X launch speed
Experimental Docker support
Export conversation to a Jupyter notebook using %jupyter
Experimental one-click installers for Mac, Linux, and Windows
Profiles preview (a feature similar to custom GPTs)

…

Added all docs and folders by @tyfiero in #873
Add safety and docker support documentation by @tyfiero in #882
Fix cli crash during screenshot generation by @double-thinker in #884
Add documentation for local models by @tyfiero in #889
Fix config edit mac os by @MikeBirdTech in #927
Add reference to open-interpreter-termux in README by @MikeBirdTech in #906
Fix feedback from input api key by @Notnaton in #932
Fix azure openai coding error by @MonsterCoder in #924
Fixed %tokens command by @CyanideByte in #896
update documentation to include the parameters by @Notnaton in #908
Fix: NameError: name 'computer' is not defined by @MikeBirdTech in #937
Update ROADMAP.md by @eltociear in #861
Fix small typo in README.md by @jameynakama in #918
Update README.md by @codesmith-emmy in #929
Update local model docs by @MikeBirdTech in #931
Profiles~ by @Notnaton in #933
Introduced a new documentation structure, added demos, and more by @tyfiero in #948
create new cli entry point and update poetry script by @MikeBirdTech in #947
Added settings by @tyfiero in #949
Update All Settings page for Open Interpreter Docs by @tyfiero in #967
Move the entry point to start_terminal_interface by @Notnaton in #960
Try to get some error logs from Markdown() by @Notnaton in #969
Add missing llm. by @Notnaton in #975
Fix the conversation restoration problem in version 0.2.0 New Computer by @Steve235lab in #977
Remove getpass add prompt_toolkit by @Notnaton in #991
[WIP] Fix broken unit tests. by @Steve235lab in #995
chore: update aifs by @shivenmian in #997
added string to multiple python file generation for skill library by @akuten1298 in #996
add docs to computer by @birbbit in #1007
Option to pass paths to docs.search by @birbbit in #1008
Load skills without overwriting them by @birbbit in #1009
Litellm custom provider by @Notnaton in #955
Jupyter export magic command by @tyfiero in #986
Docs for magic commands updated by @CyanideByte in #1004
Add doc strings to all computer API functions for the LLM to use in the skill library by @tyfiero in #1011
Fix broken link in lm-studio.mdx by @jorgepvenegas in #1019
Update README.md by @MikeBirdTech in #1017
Boost performance by lazy loading modules by @tyfiero in #1020
Quick fix to sync the computer api between processes by @tyfiero in #1022
fix parsing of a function by @birbbit in #1024
Make messages accessible to interpreter by @birbbit in #1025
Correct punctuation in the document. by @Dragonchu in #1033
ROADMAP.md by @matthewrygh in #1029
Resolve #1034 Migrate from appdirs to platformdirs by @lombad in #1038
Multi-line input support. by @Steve235lab in #1048
Fixed local.py not working on Windows. Added GPU acceleration. by @CyanideByte in #1043
Rename the app directory to open-intepreter, and migrate all files from older versions by @tyfiero in #1060
Expand computer api with texting, mail, contacts, and calendar by @tyfiero in #1061
Fix profile migration bug by @tyfiero in #1066
fix some error when start command on macos by @982945902 in #1062

New Contributors

@double-thinker made their first contribution in #884
@MonsterCoder made their first contribution in #924
@jameynakama made their first contribution in #918
@codesmith-emmy made their first contribution in #929
@Steve235lab made their first contribution in #977
@shivenmian ❤️ made their first contribution in #997
@akuten1298 made their first contribution in #996
@birbbit made their first contribution in #1007
@jorgepvenegas made their first contribution in #1019
@Dragonchu made their first contribution in #1033
@matthewrygh made their first contribution in #1029
@lombad made their first contribution in #1038
@982945902 made their first contribution in #1062
Full Changelog*: v0.2.0...v0.2.1

Contributors

MonsterCoder, jameynakama, and 16 other contributors

Assets 2

05 Jan 04:49

KillianLucas

v0.2.0

54bf90f

The New Computer Update Pre-release

Pre-release

● The New Computer Update

This is the most significant upgrade to Open Interpreter since 0.1.0. Almost every component has been rewritten to support our project's objective—building a standard interface between language models and computers.

Note: This update is not backwards compatible. If you use Open Interpreter in Python, please read our migration guide.

1. The Computer API

In 2023, Open Interpreter started building the world's first real-time code execution environment for language models.

Now, we're creating an API for language models to use in that environment, starting with basic I/O controls like display, mouse, and keyboard:

computer.display.view() # Returns a screenshot to vision models

computer.mouse.click("On-screen text") # Locates text on-screen and clicks it
computer.mouse.move(icon="Description of icon") # Locates icon on-screen and hovers over it

computer.clipboard.view() # Returns the contents of the user's clipboard

# Full reference: https://docs.openinterpreter.com/computer-api/reference

We are also launching a free preview of the hosted tools that power computer.mouse at api.openinterpreter.com.

2. `OS` Mode

You can instruct Open Interpreter to use the Computer API to control your computer graphically:

interpreter --os

Even local vision models running via .llamafile, LM-Studio, or Jan.ai are supported.

3. LMC Messages

To support the incoming Language Model Computer architecture, the new messaging format that powers Open Interpreter extends OpenAI's messages format to include additional information, and a new role called computer:

[
  {
    "role": "assistant",
    "type": "code",
    "format": "python",
    "content": "plot = create_plot_from_data('data')\ndisplay_as_image(plot)\ndisplay_as_html(plot)"
  },
  {
    "role": "computer",
    "type": "image",
    "format": "base64.png",
    "content": "base64"
  },
  {
    "role": "assistant",
    "type": "message",
    "content": "Plot generated successfully."
  }
]

Read about LMC Messages here.

And more...

In addition to these major updates, 0.2.0 comes with a suite of fixes and enhancement from our growing open-source community:

Fixes crash UnboundLocalError active_block by @CyanideByte in #818
Inserts dummy api key if missing by @CyanideByte in #808
Fix README_JA.md by @tegnike in #810
return empty dict if config file is empty by @sbendary25 in #811
Add pyautogui mouse click functions and parameters by @Arrendy in #843
Fixed the error when using Azure OpenAI API by @wsbao in #840
Add package mismatches info to system_debug_info by @tegnike in #800
Update keyboard control functions for better input handling by @Arrendy in #845
implement mouse position function by @Arrendy in #850
Fixed another a few bugs in using OpenAI API/Azure OpenAI API/OpenAI compatible custom API by @wsbao in #848
Added new docs for litellm hosted models by @tyfiero in #858
Added refreshed docs for terminal arguments and python streaming responses by @tyfiero in #864
Add os docs by @tyfiero in #868
Fixed the case where UnicodeDecodeError by @Moonlight-YS in #863

New Contributors

@tegnike made their first contribution in #810
@sbendary25 made their first contribution in #811
@Arrendy made their first contribution in #843
@wsbao made their first contribution in #840
@tyfiero made their first contribution in #858
@Moonlight-YS made their first contribution in #863

Full Changelog: v0.1.17...v0.2.0

This is only the beginning. Happy 2024. ●

Contributors

CyanideByte, wsbao, and 5 other contributors

Assets 2

28 Dec 12:14

KillianLucas

v0.1.18

1af4cb7

The New Computer Update (Preview) Pre-release

Pre-release

●

The New Computer Update is the most significant update to Open Interpreter since v0.1.0.

Ahead of its release, we're soft-launching this preview to get feedback on the update.

pip install --upgrade open-interpreter
interpreter --os

Assets 2

05 Dec 02:29

KillianLucas

v0.1.17

96ab4aa

v0.1.17 Pre-release

Pre-release

Minor fix: `gpt-4-1106-preview` → `gpt-4`

We decided to make gpt-4 no longer default to gpt-4-1106-preview, which we've found performs worse on code interpretation tasks.

Important Note: This was not published from the repository. The last release, v0.1.16, was simply patched to not switch to gpt-4-1106-preview. The updates in the repo (and the commit associated with this release) are therefore out of sync with the binaries below.

We'll be back in sync by the next release of 0.2.0, the New Computer update.

Assets 4

26 Nov 03:53

KillianLucas

v0.1.16

838e593

v0.1.16 Pre-release

Pre-release

Critical fixes and cool new magic commands in this one.

New magic commands: %% {insert shell code} for custom shell code running and %info added by @CyanideByte
Significantly better debugging with more system information in errors, thank you @Notnaton!
Create multiple agents / instances and watch them communicate in Python
Azure now works with the config thanks to @hargunmujral in #786
@CyanideByte made it so if the user can't access gpt-4, we give a great error, and let them switch to gpt-3.5
@pratss10 added some helpful documentation (with screenshots!) for setting up gpt-4
vision now works via the Python package
Brand new German README thanks to @iongpt!
We're significantly more organized — one folder for the terminal_interface, another for core
Much better shell output processing
Finally, a proper ROADMAP!

Thank you to everyone that made it happen!

Cannot go without thanking the extraordinary @ericrallen for this release as well, who practically single-handedly brought our issues below 100 (from nearly 300 just a few weeks ago). Have a safe trip to Japan Eric!

New Contributors

@joshuavial made their first contribution in #767
@pratss10 made their first contribution in #773
@hargunmujral made their first contribution in #786
@iongpt made their first contribution in #738

Full Changelog: v0.1.15...v0.1.16

Contributors

joshuavial, ericrallen, and 5 other contributors

Assets 2

15 Nov 09:20

KillianLucas

v0.1.15

2eb14b7

v0.1.15 Pre-release

Pre-release

Quick fixes to resolve some common issues:

Streaming responses in Python missing flag issue has been fixed (end_of_message wasn't always being emitted)
Added more useful info to system info by @Notnaton in #750
chore: popen using utf8 encoding by @fatelei in #742
MOD: close subprocess and its file handles when reset by @coderfengyun in #476

New Contributors

@fatelei made their first contribution in #742
@coderfengyun made their first contribution in #476

Full Changelog: v0.1.14...v0.1.15

Contributors

fatelei, Notnaton, and coderfengyun

Assets 2

11 Nov 10:34

KillianLucas

v0.1.14

eb51c8d

Vision I (Quick Fixes II) Pre-release

Pre-release

An issue with UNIX files has been resolved (#748)
Experimental support for Python in --vision mode has been added

Full Changelog: v0.1.13...v0.1.14

Assets 2

11 Nov 06:16

KillianLucas

v0.1.13

843af04

Vision I (Quick Fixes I) Pre-release

Pre-release

Quick fix for --vision support Windows. File paths should now be properly recognized and loaded into the model.

Full Changelog: v0.1.12...v0.1.13

Assets 2

10 Nov 22:35

KillianLucas

v0.1.12

7543487

Vision I Pre-release

Pre-release

A quick one, a fun one. Added experimental vision support for OpenAI users.

interpreter --vision

Drag files / screenshots into your terminal to use it. Also supports reflection for HTML. (It can see the designs it produces!)

Vision II will introduce support for reflective vision in many more languages.

What's Changed

Get more system info in debug mode by @Notnaton in #701
Quickfix .lower() by @Notnaton in #744
Unsupported lang fix by @CyanideByte in #745

New Contributors

@CyanideByte made their first contribution in #745

Full Changelog: v0.1.11...v0.1.12

Contributors

Notnaton and CyanideByte

Assets 2

09 Nov 16:52

KillianLucas

v0.1.11

bf939ef

Local II Update Pre-release

Pre-release

Local mode is now powered by LM Studio. Running --local will tell you how to setup LM Studio + connect to it automatically.
It's way smaller. Removed the MASSIVE local embedding model, chromadb, oobabooga, a bunch of other packages we didn't really need. Semgrep is now optional.
The system message is tighter, so it's cheaper + faster on any LLM.

Several crashes have also been resolved, temperature is now properly set to 0 (which should increase performance on OpenAI models), Powershell on Linux support, an ugly print statement was removed, we're now enforcing a consistent code style (black, isort), and much more:

What's Changed

Fix a crash in get_relevant_procedures_string by @Notnaton in #676
Fixed issue with 3.5 Models hallucinating a function called "python", causing empty returns by @Cobular in #683
Fixed Issue #661 by @shubhexists in #687
Consider hiding the api-key by @Notnaton in #682
Remove space from language by @Notnaton in #680
Update README.md by @Shivam250702 in #677
docs: fix typo in docs/WINDOWS.md by @suravshresth in #658
Update MACOS.md by @ihgalis in #471
Update README.md with ref to MACOS.md by @ihgalis in #472
Clarity updates to README.md by @shruti222patel in #674
fix: make sure safe_mode is disabled by default by @ericrallen in #705
Update README.md to say Mistral 7B by @Maclean-D in #690
fix: default to 0 if temperature is unset by @ericrallen in #710
chore: add better docs for safe_mode; load semgrep if available by @ericrallen in #709
chore: remove extraneous chroma package; clean up dependency list by @ericrallen in #704
fix: make all local models run through the html entity buffer by @ericrallen in #703
Fix GNU readline support for user input by @cowile in #522
docs: fix markdown formatting; better explain %tokens by @ericrallen in #695
fix: clarify that %tokens estimates token usage for next request by @ericrallen in #694
Proposal: Enforce consistent code style with black, isort, and pre-commit by @ericrallen in #699
Fix for failing Powershell code execution on Linux by @thefazzer in #729
Update README.md for sample FastAPI by @AndrewNgo-ini in #727
Update MACOS.md: fix command by @pjpjq in #725
update Readme_ZH.md by @HashCookie in #712

New Contributors

@Notnaton made their first contribution in #676
@Cobular made their first contribution in #683
@shubhexists made their first contribution in #687
@Shivam250702 made their first contribution in #677
@suravshresth made their first contribution in #658
@shruti222patel made their first contribution in #674
@cowile made their first contribution in #522
@thefazzer made their first contribution in #729
@AndrewNgo-ini made their first contribution in #727
@pjpjq made their first contribution in #725
@HashCookie made their first contribution in #712

Full Changelog: v0.1.10...v0.1.11

Great work everyone!

Contributors

ericrallen, Notnaton, and 12 other contributors

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mac Control

LLM-first Web Browser

Point Model

And more:

New Contributors

Contributors

● The New Computer Update

1. The Computer API

2. `OS` Mode

3. LMC Messages

And more...

New Contributors

Contributors

Minor fix: `gpt-4-1106-preview` → `gpt-4`

New Contributors

Contributors

New Contributors

Contributors

What's Changed

New Contributors

Contributors

What's Changed

New Contributors

Contributors

Releases: OpenInterpreter/open-interpreter