<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic College Course Use - Sharing Data With Students in Administration &amp; Architecture</title>
    <link>https://community.databricks.com/t5/administration-architecture/college-course-use-sharing-data-with-students/m-p/146723#M4796</link>
    <description>&lt;P&gt;Hello Everyone,&lt;/P&gt;&lt;P&gt;I am creating a new college course on Database design and SQL analytics and have decided to use Databricks as our platform in the course.&amp;nbsp; We are going to be using the Free Edition so students do not need to pay for access.&amp;nbsp; I'm wondering what solutions people have found for creating datasets and sharing them with students?&amp;nbsp; From what I can tell, the free edition limits sharing directly via emails and also limits Delta Shares.&amp;nbsp; Is my only option to export to .csv files and then have the students create their own tables using the .csv file?&lt;BR /&gt;&lt;BR /&gt;The same question goes for SQL editor scripts; I created some demos that I walked through in class but I would like to share the editor files directly.&amp;nbsp; Is that possible using the Free Edition?&amp;nbsp; My current work around is copying the SQL queries to a .txt file and the students copy &amp;amp; paste from the .txt into their own SQL editor.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hoping there might be some easier sharing opportunities that I'm missing in the Free Edition.&lt;/P&gt;</description>
    <pubDate>Tue, 03 Feb 2026 13:09:12 GMT</pubDate>
    <dc:creator>Drew_Prof</dc:creator>
    <dc:date>2026-02-03T13:09:12Z</dc:date>
    <item>
      <title>College Course Use - Sharing Data With Students</title>
      <link>https://community.databricks.com/t5/administration-architecture/college-course-use-sharing-data-with-students/m-p/146723#M4796</link>
      <description>&lt;P&gt;Hello Everyone,&lt;/P&gt;&lt;P&gt;I am creating a new college course on Database design and SQL analytics and have decided to use Databricks as our platform in the course.&amp;nbsp; We are going to be using the Free Edition so students do not need to pay for access.&amp;nbsp; I'm wondering what solutions people have found for creating datasets and sharing them with students?&amp;nbsp; From what I can tell, the free edition limits sharing directly via emails and also limits Delta Shares.&amp;nbsp; Is my only option to export to .csv files and then have the students create their own tables using the .csv file?&lt;BR /&gt;&lt;BR /&gt;The same question goes for SQL editor scripts; I created some demos that I walked through in class but I would like to share the editor files directly.&amp;nbsp; Is that possible using the Free Edition?&amp;nbsp; My current work around is copying the SQL queries to a .txt file and the students copy &amp;amp; paste from the .txt into their own SQL editor.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hoping there might be some easier sharing opportunities that I'm missing in the Free Edition.&lt;/P&gt;</description>
      <pubDate>Tue, 03 Feb 2026 13:09:12 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/college-course-use-sharing-data-with-students/m-p/146723#M4796</guid>
      <dc:creator>Drew_Prof</dc:creator>
      <dc:date>2026-02-03T13:09:12Z</dc:date>
    </item>
    <item>
      <title>Re: College Course Use - Sharing Data With Students</title>
      <link>https://community.databricks.com/t5/administration-architecture/college-course-use-sharing-data-with-students/m-p/146741#M4797</link>
      <description>&lt;P&gt;Hey&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/214306"&gt;@Drew_Prof&lt;/a&gt;&amp;nbsp;,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Short answer: With Databricks Free Edition, you can’t act as a Delta Sharing provider or use Marketplace to distribute data, and you don’t have access to account-level sharing features. Instead, the most reliable path is to distribute files (CSV/Parquet) and have students load them into their own workspaces using Unity Catalog volumes; for SQL, share .sql files or notebooks via a public Git repo or simple file upload/import. This keeps each student within their own Free Edition workspace and avoids quota/contention issues.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Recommended patterns that work well for a class&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Datasets (tables) &lt;STRONG&gt;Option A&lt;/STRONG&gt; — Distribute files; students load into their own volume (recommended)&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;You publish small-to-moderate CSV/Parquet files through your LMS or a public link (GitHub release, course site).&lt;/LI&gt;
&lt;LI&gt;Students upload the files to a Unity Catalog volume in their own Free Edition workspace (Catalog &amp;gt; Volumes &amp;gt; Upload). Free Edition supports volumes; DBFS root is restricted.&lt;/LI&gt;
&lt;LI&gt;Students create tables over those files. Example:&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;SQL -- one-time setup CREATE CATALOG IF NOT EXISTS workspace; CREATE SCHEMA IF NOT EXISTS workspace.default; CREATE VOLUME IF NOT EXISTS workspace.default.course_data;&lt;/P&gt;
&lt;P&gt;-- after uploading orders.csv to the volume: CREATE TABLE IF NOT EXISTS workspace.default.orders USING CSV OPTIONS (header true, inferSchema true) LOCATION '/Volumes/workspace/default/course_data/orders.csv';&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Option B&lt;/STRONG&gt; — Provide a “bootstrap” notebook or SQL file&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Ship a small notebook or .sql file that: (1) creates the volume, (2) gives students a step to upload files, (3) executes the CREATE TABLE commands. This minimizes copy/paste errors and standardizes table names.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Notes&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Favor Parquet where possible to cut file size and speed up loads (especially useful under small-warehouse limits).&lt;/LI&gt;
&lt;LI&gt;Avoid relying on external HTTP downloads from within the workspace; Free Edition outbound access is allowlisted and may not include arbitrary hosts.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;SQL editor scripts and teaching materials &lt;STRONG&gt;Option C&lt;/STRONG&gt; — Public Git repository&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Put .sql files and notebooks in a public GitHub repo.&lt;/LI&gt;
&lt;LI&gt;Students either:
&lt;UL&gt;
&lt;LI&gt;Use Git folders (if enabled for their Free Edition workspace) to clone the repo; or&lt;/LI&gt;
&lt;LI&gt;Download files from GitHub and use “Upload” in the Databricks Workspace or SQL Editor Files to import .sql or notebooks.&lt;BR /&gt;This is the simplest way to share SQL editor content without depending on workspace invites. (Git folders are generally available in Databricks; if they’re not visible in a student’s Free Edition workspace, file upload still works.)&lt;/LI&gt;
&lt;/UL&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;STRONG&gt;Option D&lt;/STRONG&gt; — Export/import notebooks (.dbc or source)&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Export notebooks as .dbc or source files and post them to the LMS.&lt;/LI&gt;
&lt;LI&gt;Students import via Workspace &amp;gt; Import; then they can open the SQL cells in the editor.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;What not to rely on in Free Edition&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Delta Sharing as a provider or Marketplace distribution: provider objects are created at the account/metastore layer and Free Edition does not expose the account console/APIs; Marketplace provider access is explicitly disallowed.&lt;/LI&gt;
&lt;LI&gt;Single shared instructor workspace for the whole class: one tiny SQL warehouse plus fair‑use quotas will bottleneck and may shut compute down for the day if exceeded.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;If you really want “in‑platform” collaboration&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;You can add a small number of collaborators to a single workspace and co‑edit notebooks/SQL files in real‑time, but keep groups small and time‑boxed to avoid quotas.&lt;/LI&gt;
&lt;LI&gt;For larger cohorts, stick with each student’s own Free Edition workspace + file/Git distribution.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Quick starter checklist you can reuse in your syllabus&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;Provide download links for datasets (CSV/Parquet) and a bootstrap SQL file/notebook that:
&lt;OL&gt;
&lt;LI&gt;Creates catalog/schema/volume&lt;/LI&gt;
&lt;LI&gt;Instructs students to upload files&lt;/LI&gt;
&lt;LI&gt;Runs CREATE TABLE … USING CSV/Parquet LOCATION '/Volumes/…'&lt;/LI&gt;
&lt;/OL&gt;
&lt;/LI&gt;
&lt;LI&gt;Host all SQL editor examples in a public GitHub repo as .sql files; add a README with “Upload into SQL Editor Files” instructions.&lt;/LI&gt;
&lt;LI&gt;Keep file sizes modest and table counts reasonable to respect Free Edition limits.&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;Hope this helps, Louis.&lt;/P&gt;</description>
      <pubDate>Tue, 03 Feb 2026 15:59:06 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/college-course-use-sharing-data-with-students/m-p/146741#M4797</guid>
      <dc:creator>Louis_Frolio</dc:creator>
      <dc:date>2026-02-03T15:59:06Z</dc:date>
    </item>
    <item>
      <title>Re: College Course Use - Sharing Data With Students</title>
      <link>https://community.databricks.com/t5/administration-architecture/college-course-use-sharing-data-with-students/m-p/146742#M4798</link>
      <description>&lt;P class="wnfdnte" data-pm-slice="1 1 []"&gt;One other point and, quick win for course datasets on Free Edition.&lt;/P&gt;
&lt;P class="wnfdnte"&gt;Databricks Labs has a purpose‑built synthetic data toolkit: &lt;STRONG&gt;dbldatagen&lt;/STRONG&gt; (Databricks Labs Data Generator). It’s open source and runs great on Free Edition with a simple notebook‑scoped install.&lt;/P&gt;
&lt;UL dir="auto"&gt;
&lt;LI dir="auto"&gt;
&lt;P class="wnfdnte"&gt;Install:&lt;/P&gt;
&lt;UL dir="auto"&gt;
&lt;LI dir="auto"&gt;
&lt;P class="wnfdnte"&gt;In a notebook cell: &lt;CODE&gt;%pip install dbldatagen&lt;/CODE&gt;.&lt;/P&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;/LI&gt;
&lt;LI dir="auto"&gt;
&lt;P class="wnfdnte"&gt;Links:&lt;/P&gt;
&lt;UL dir="auto"&gt;
&lt;LI dir="auto"&gt;
&lt;P class="wnfdnte"&gt;GitHub: &lt;A href="https://github.com/databrickslabs/dbldatagen" target="_blank" rel="noopener noreferrer nofollow"&gt;https://github.com/databrickslabs/dbldatagen&lt;/A&gt;&lt;/P&gt;
&lt;/LI&gt;
&lt;LI dir="auto"&gt;
&lt;P class="wnfdnte"&gt;Docs: &lt;A href="https://databrickslabs.github.io/dbldatagen/" target="_blank" rel="noopener noreferrer nofollow"&gt;https://databrickslabs.github.io/dbldatagen/&lt;/A&gt;&lt;/P&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;UL dir="auto"&gt;
&lt;LI dir="auto"&gt;
&lt;P class="wnfdnte"&gt;Works out of the box on Databricks runtimes and Community/Free Edition via &lt;CODE&gt;%pip&lt;/CODE&gt; (no special cluster libs).&lt;/P&gt;
&lt;/LI&gt;
&lt;LI dir="auto"&gt;
&lt;P class="wnfdnte"&gt;No extra Python deps beyond what the Databricks runtime already includes for supported runtimes.&lt;/P&gt;
&lt;/LI&gt;
&lt;LI dir="auto"&gt;
&lt;P class="wnfdnte"&gt;You can expose the generated DataFrame as a view and consume from other languages (SQL, Scala, R).&lt;/P&gt;
&lt;/LI&gt;
&lt;LI dir="auto"&gt;
&lt;P class="wnfdnte"&gt;Comes with plug‑in style &lt;STRONG&gt;standard datasets&lt;/STRONG&gt; to jump‑start common examples.&lt;/P&gt;
&lt;/LI&gt;
&lt;LI dir="auto"&gt;
&lt;P class="wnfdnte"&gt;Supports &lt;STRONG&gt;multi‑table generation&lt;/STRONG&gt; with cross‑references — perfect for relational concepts (FKs, dimensions/facts).&lt;/P&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;P class="wnfdnte"&gt;Copy/paste starter&lt;/P&gt;
&lt;PRE class="_1ibi0s3bv" dir="auto"&gt;&lt;CODE class="language-python"&gt;%pip install dbldatagen

import dbldatagen as dg

dataspec = (
    dg.DataGenerator(spark, name="customers", rows=10_000)
      .withColumn("customer_id", "int", minValue=1, maxValue=10_000)
      .withColumn("name", "string", template=r"\w \w")
      .withColumn("email", "string", template=r"\w@\w.com")
      .withColumn("signup_date", "date", begin="2020-01-01", end="2024-12-31")
)

df = dataspec.build()
df.write.saveAsTable("customers")
&lt;/CODE&gt;&lt;/PRE&gt;
&lt;P class="wnfdnte"&gt;&amp;nbsp;&lt;/P&gt;
&lt;P class="wnfdnte"&gt;Cheers, Lou&lt;/P&gt;</description>
      <pubDate>Tue, 03 Feb 2026 16:01:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/college-course-use-sharing-data-with-students/m-p/146742#M4798</guid>
      <dc:creator>Louis_Frolio</dc:creator>
      <dc:date>2026-02-03T16:01:43Z</dc:date>
    </item>
    <item>
      <title>Re: College Course Use - Sharing Data With Students</title>
      <link>https://community.databricks.com/t5/administration-architecture/college-course-use-sharing-data-with-students/m-p/146745#M4799</link>
      <description>&lt;P&gt;Excellent information, thank you!&amp;nbsp; The data generator was something I was not aware of so I will check that out.&lt;/P&gt;</description>
      <pubDate>Tue, 03 Feb 2026 16:31:46 GMT</pubDate>
      <guid>https://community.databricks.com/t5/administration-architecture/college-course-use-sharing-data-with-students/m-p/146745#M4799</guid>
      <dc:creator>Drew_Prof</dc:creator>
      <dc:date>2026-02-03T16:31:46Z</dc:date>
    </item>
  </channel>
</rss>

